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List of potentially matching sequences 



Send selected sequences to J Clustal W (multiple alignment) 
Select up to... 




n Include query sequence 

Db AC Description Score E-value 

□ sp PI 9 196 INVA_YEREN ^^^^ {^^si^a enterocolitica] 16 39 0.0 
[j tr Q56889 invasin [invA] [Yersin i~a~~'eh terocolitica] 1613 0.0 
D sp P119_22 INVA_YERPS Invasin [Yersinia pseudotuberculosis] 726 0.0 

□ tr 05693 7 Invasin [inv] [Yersinia pestis] 721 0.0- 
P tr Q8ZA73 Putative invasin (Hypothetical) [YP03944] [Yersinia pe . . . 428 e-118 

□ tr Q74QZ6 Putativein vasin JYP33061 [Yersinia pestis] 428 e-118 

TT tr Q6KC27 IntimirTTambda ^feaej) [Escherichia coli] 342 2e-92 



□ tr Q8KRK8 Intimin lambda [Escherichia coli] 340 6e-92 

□ tr Q8VL95 Theta ij^^^^ [eae] [Escherichia coli] 336 le-90 
n tr Q8VL00 Theta intimin [Escherichia coli] 335 2e-90 

sp O31Q00 EAE_EC011 Intimin (Attaching and effacing protein) (Ea... 333 6e-90 

O <tr Q6EV99 Intimln sic^ma [eael [Escherichia coli] ; 332 2e-89_ 

D tr Q93FK0 Intimin C^itro bact^r rodentiuml 23_--_Hr**S r ' 



C tr Q93UI3 Intimin type gamma [eae] [Escherichia coli] 331 4e-89 

□ tr Q842D8 Intimin [eae] [Escherichia coli] 330 5e-89 
J~J tr P77067 Beta intimin (Eae) [eaeA] [Escherichia coli] 330 5e-89 

P tr Q9RCH1 Intimin [eaeA] [Escherichia coli] 330 7e-89 

□ tr Q9R8R5 Intimin [eae] [Escherichia coli] 330 7e-89 

□ tr Q9S6R2 Intimin [eaeA] [Escherichia coli] 330 7e-89 



U2 



tr Q6KC44 Intimin mu [eae] [Escherichia coli] : 12LQ7e-89 



tr Q4/018 Intimin [eaeA] [Escherichia coli] 330 9e-89 

[j tr Q8X6G3 Putative adhesin [eaeH] . [Escherichia coli 0157 :H7] 329 le-88 

tr Q93K93 Intimin-kappa [eae-kappa] [Escherichia coli] 329 le-88 

□ tr Q7AHB0 Putative invasin [ECs0336] [Escherichia coli 0157 :H7] 3J29 le-88 
D tr Q8FKK3 Putative adhesin [eaeH] [Escherichia coli 06] 329 le-88 
fj tr Q93K94 Intimin-jota [eae-jota] [Escherichia coli] 328 2e-88 
P tr Q9LBG6 Intimin [eae] [Escherichia coli] 328 2e-88 
fj tr Q6KC43 Intimin nu [eae] [Escherichia coli] 328 2e-88 

□ tr Q8KRL1 Intimin epsilon 2 [Escherichia coli] 327 4e-88 
n~ tr Q9EXN4 Intimin zeta [eae] [Escherichia coli] 326 le-87 
P tr Q8KRK9 Intimin beta 2 [eae] [Escherichia coli] 326 le-87 
fj sp Q075 91/ 1sAi^CITFR Intimin (Attaching and effacing protein) (Ea... 326 le-87 
H tr Q6QZWl^Eae^ (Fragment) [Escherichia coli] 326 le-87 
O tr Q6KC41 Intimin pi [eae] [Escherichia coli] 326 le-87 

□ tr Q6KC42 Intimin xi [eae] [Escherichia coli] 325 2e-87 
C tr Q8VL93 Zeta intimin [Escherichia coli] 325 2e-87 
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□ tr Q846P2 Intimin-zeta [eae] [Escherichia coli] 325 3e-87 
n sp P43261 EAE__EC057 Intimin (Attaching and effacing protein) (Ea... 324 4e-87 
P tr Q9RGP3 Intimin type epsilon [eae] [Escherichia coli] 324 4e-87 

□ tr Q9R8B5 Gamma intimin [eae] [Escherichia coli] 324 4e-87 

□ tr Q9R8B4 Gamma intimin [eae] [Escherichia coli] 324 4e-87 

□ tr Q8RSS9 Intimin eta [eae] [Escherichia coli] 323 le-86 
[J tr Q84FQ2 Intimin epsilon [Escherichia coli] 323 le-86 
HI tr Q93UI4 Intimin type beta [eae] [Escherichia coli] 322 2e-86 
D tr Q8RNT8 Intimine type epsilon [eae] [Escherichia coli] 322 2e-86 
P tr Q8KRL0 Intimin alpha 2 [eae] [Escherichia coli] 321 4e-86 

□ tr 006896 Intimin [eae] [Escherichia coli] 318 2e-85 
P tr 052620 Intimin [eaeA] [Escherichia coli] 318 3e-85 

□ sp P19809 EAE_EC027 Intimin (Attaching and effacing protein) (Ea... 317 6e-85 
P tr Q9F609 Intimin type epsilon (Fragment) [eaeA] [Escherichia coli] 316 le-84 
P tr Q8KRL2 Intimin jota 2 [Escherichia coli] 316 le-84 
P tr Q6KAV7 Intimin rho [eae] [Escherichia coli] 311 2e-83 
P tr Q7N599 Similarities with putative adhesin [plu2057] [Photorha. . . 310 9e-83 
P tr Q9EYM6 Intimin type epsilon (Fragment) [eaeA] [Escherichia coli] 310 9e-83 
p tr Q8XB95 Putative invasin [z5932] [Escherichia coli 0157 :H7] 304: 4e-81 
lj tr Q7A8L6 Putative invasin [ECs5290] [Escherichia coli 0157 :H7] 298 4e-79 
P tr Q8D083 Putative adhesin [y2605] [Yersinia pestis] 296 le-78 
P sp P76347 YEEJ_ECOLI Hypothetical protein yeeJ [yeeJ] [Escherich. . . 293 le-77 
P sp Q8X8V7 YEEJ„ EC057 Hypothetical protein yeeJ [z3135] [Escheric... 289 2e-76 
P tr Q7CQ17 (sinH>)[sinH] [Salmonella typhimurium] 245 3e-63 
I J tr Q9XCI9 *SinH [sinH] [Salmonella typhimurium] 245 3e-63 
P tr Q8FF61 SinH homolog [c3031] [Escherichia coli 06] 235 3e-60 
P tr Q9L574 Invasin-like SivH [sivH] [Salmonella typhi] 233 9e-60 
P tr Q7X2C2 Aecl precursor [aecl] [Escherichia coli] 233 le-59 
P sp P36943 EAEH_ECOLI Attaching and effacing protein homolog prec . . . 209 le-52 
P sp P39165 YCH0_EC0LI Hypothetical protein ychO [ychO] [Escherich... 199 2e-49 
P tr Q8XDD4 YchP protein [ychP] [Escherichia coli 0157 :H7] 198 3e-49 
P tr Q7AEU6 Putative factor [ECsl725] [Escherichia coli 0157 :H7] 198 3e-49 
P tr Q83RN8 Putative factor [ychP] [Shigella flexneri] 198 4e-49 
P tr Q7UCS2 Putative factor [ychP] [Shigella flexneri] 198 4e-49 
P tr Q8FHZ3 Hypothetical protein ychP [ychP] [Escherichia coli 06] 195 3e-48 
P tr Q93GR3 Putative adhesin [PSLT034] [Salmonella typhimurium] 190 8e-47 
P tr Q.8ZP33 Putative invasin [ychP] [Salmonella typhimurium] ■ 186 le-45 
P tr Q8Z7G3 Putative invasin [STY1284] [Salmonella typhi] 186 le-45 
P tr Q8ZP78 Homology to invasin C of Yersinia; intimin [STM1669] [... 158 5e-37 
P tr Q83T27 Invasin-like protein [tl573] [Salmonella typhi] 157 le-36 
P tr Q8Z795 Invasin-like protein [STY1395] [Salmonella typhi] 156 le-36 
P tr Q7WR47 Putative adhesin [BB0110] [Bordetella, bronchiseptica (... 85 5e-15 
P tr Q7W286 Putative adhesin [BPP0104] [Bordetella parapertussis] 85 6e-15 
P tr Q7WAX9 Putative outer membrane ligand binding protein [bipA] ... 70 le-10 
P tr Q9APE8 Putative outer membrane ligand binding protein [bipA] . . . 69 4e-10 
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□ 


tr 


Q7VZ27 


Putative outer membrane ligand binding protein [bipA] . . . 


69 


4e 


-10 


n 


tr 


Q7VR4 9 


Putative adhesin [eaeH] [Candidatus Blochmannia florid. . . 


67 


le 


-09 


□ 


tr 


Q52708 


Outer membrane protein A (Fragment) [ompA] [Rickettsia. . . 


53 


3e 


-05 


□ 


tr 


Q6CD35 


Similarity [YALI0C04136g] [Yarrowia lipolytica CLIB99] 


53 


3e 


-05 


n 


tr 


Q82XT8 


Hemolysin -type calcium-binding region :RTX N- terminal d. . . 


11 


le 


-04 


□ 


sp 


Q52657 


OMPA_RICCN Outer membrane protein A precursor (190 kDa. . . 


50 


2e 


-04 


n 


tr 


Q6CPZ4 


Kluyveromyces lactis strain NRRL Y-1140 chromosome E o. . . 


50 


2e 


-04 


n 


sp 


Q8TFG9 


YL61_SCHP0 Hypothetical serine/threonine-rich protein . . . 


47 


0. 


002 


□ 


tr 


Q9C105 


SPAPB1E7 . 04c protein [SPAPB1E7 . 04c] [Schizosaccharomyc . . . 


47 


0. 


002 


□ 


tr 


Q722C9 


Cell wall surface anchor family protein [LMOf2365 0805... 


46 


0. 


003 


r 


tr 


Q76I98 


Bacterial immunogloblin [is2-2] [Leptospira interrogan. . . 


46 


0. 


003 


n 


tr 


Q7U7M8 


Hypothetical [SYNW0953] [Synechococcus sp . (strain WH8 . . . 


45 


0. 


004 


□ 


tr 


Q8FGD9 Hypothetical protein c2439 [c2439] [Escherichia coli 06] 


45 


0. 


006 




tr 


Q6KB38 


Intimin zeta (Fragment) [eae] [Escherichia" coli] 


45 


0. 


006 


□ 


tr 


Q6AHX5 


Hypothetical protein DKFZp686O19206 [DKFZp686O19206] [... 


45 


0. 


006 


□ 


sp 


P15921 


OMPA_RICRI Outer membrane protein A precursor (190 kDa. . . 


45 


0. 


007 


r 


tr 


Q8VW0 


Intimin-lambda (Fragment) [eae-lambda] [Escherichia coli] 


11 


0. 


010 


n 


tr 


Q8VLL9 


Intimin (Fragment) [eae] [Escherichia coli] 


44 


0. 


010 


□ 


tr 


Q6KBV3 


Intimin zeta (Fragment) (Fragment) [eae] [Escherichia . . . 


44 


0. 


010 
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Alignments 



sp 



P19196 
INVA YEREN 




Invasin [Yersinia enterocolitica] 



835 AA 
align 



Score = 1639 bits (4243), Expect = 0.0 

Identities = 819/835 (98%), Positives = 819/835 (98%) 

Query : 1 MYSFFNTLTVTKXXXXXXXXXXXXXXXXTYGFSQQHYFNSEALENPAEHNEAFNKIISTG 60 



Sbjct : 1 M YS F FNTLTVTKI I S RL I LS I GL I FG I FTYGFS QQHYFNS E ALENPAEHNE AFNKI I S TG 60 

Query: 61 TSLAVSGNASNITRSMWDAANQEVKHWLNRFGTTQVimJFDKKFSLKESSLDWLLPWY^ 12 0 

TSLAVSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKESSLDWLLPWYD 
Sbjct: 61 TSLAVSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNW 12 0 

Query: 121 SASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAW 180 

SASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAW 
Sbjct: 121 SASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAW 180 

Query: 181 TDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 240 

TDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 
Sbjct: 181 TDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 240 

Query: 241 RVALFGKDNLQSNP YAVTTGL I YTPI PF ITLGVDQRMGKSRQHE I QWNLQMDYRLGES FR 300 

RVAL FGKDNLQSNP YAVTTGL I YTP I PF I TLGVDQRMGKSRQHE I QWNLQMD YRLGE S FR 
Sbjct: 241 RVALFGKDNLQSNP YAVTTGL I YTP I PF I TLGVDQRMGKSRQHE I QWNLQMDYRLGES FR 300 

Query: 3 01 S QF S P AWAGTRLL AE S R YNL VERNPNI VLE YQKQNT I KLAFS PAVL SGL PGQ VYS VS AQ 360 

SQFS PAWAGTRLLAESRYNLVERNPNI VLEYQKQNTI KLAFSPAVLSGLPGQVYSVSAQ 
Sbjct: 3 01 SQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQ 360 

Query: 361 IQSQSALQRILWNDAQWVAAGGKLIPVSATDYNWLPPYKPMAPASRTVGKTGESEAAVN 420 

I QSQS ALQRI LWNDAQWVAAGGKL I P VS ATD YNVVLPP YKPMAPASRTVGKTGES E AAVN 
Sbjct: 361 IQSQSALQRILWNDAQWVAAGGKLIPVSATDYNWLPPYKPMAPASRTVGKTGESEAAVN 420 

Query: 421 TYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDS 480 

TYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDS 
Sbjct: 421 TYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDS 480 

Query: 481 TPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRF 54 0 

TPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSVVTLDIQGQQATVDVRF 
Sbjct: 481 TPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRF 54 0 

Query: 541 AVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP 600 

AVL PPDVTNSS FNVS PSD I VADGSMQS I LTFVPRNKNNEFVSGI TDLEFIQSGVPVTI S P 
Sbjct: 541 AVLPPD VTNSSFNVSPSDI VADGSMQS ILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP 600 

Query: 601 VTENADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 660 

VTENADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 
Sbjct: 601 VTENADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 660 

Query: 661 GFPKTTFNKATFQLVTVINDDVANNTQYDWTSSYAASAPVI)NQGKWIAYKTYGSTVTVTAK 72 0 

G FP KTTFN KAT FQL VMNDD VANNTQ YDWTS S YAAS APVDNQGKVN I AYKTYGS T VTVTAK 
Sbjct: 661 GFPKTTFNKATFQLV1VINDDVANNTQYDWTSSYAASAPVIDNQGKVNIAYKTYGSTVTVTAK 720 

Query: 721 S KKFPS YTATYQFKPNLWVFSGTMSLQS S VEASRNCQRTDFTAL I ES ARASNGSRS PDGT 780 

S KKFPS YTATYQFKPNLWVFSGTMSLQS SVEASRNCQRTDFTAL I ES ARASNGSRS PDGT 



MYS FFNTLTVTK 



TYGFSQQHYFNSEALENPAEHNEAFNKI I STG 



http://us.expasy.org/cgi-bin/blast.pl 



9/14/04 



ExPASy BLAST2 Interface 



Page 2 of 13 



Sbjct: 721 SKKFPSYTATYQFKPNLWVFSGTMSLQSSVEASRNCQRTDFTALIESARASNGSRSPDGT 780 

Query: 781 LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTGDIPTSAATAYPLCAEPQ 835 

LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTGDIPTSAATAYPLCAEPQ 
Sbjct: 781 LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTGDIPTSAATAYPLCAEPQ 835 



tr Q56889 Invasin [invA] [Yersinia enterocolitica] 835 AA 

align 

Score = 1613 bits (4176), Expect = 0.0 

Identities = 805/835 (96%), Positives = 811/835 (96%) 

Query : 1 MYSFFNTLTVTKXXXXXXXXXXXXXXXXTYGFSQQHYFNSEALENPAEHNEAFNKIISTG 60 

MYS FFNTLTVTK TYGFSQQ+YFNSEALENPAEHNEAFNKIISTG 
Sbjct: 1 MYS FFNTLTVTKI I SRL I L S I GL I FG I FT YGFS QQNYFNS E ALENP AEHNE AFNKI I S TG 60 

Query: 61 TSLAVSGNASNITRSIWNDAANQEVKHWLNRFGTTQWWFDKKFSLKESSLDWLLPWYD 12 0 

TSLAVSGNASNITRSMWDAANQEVOT 
Sbjct : 61 TSLAVSGNASNITRSMWDAANQEVKHWLNRFGTTQV^mTFDKKFSLKESSLDWLLPWYD 12 0 

Query: 121 SASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAW 180 

SASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNT YDMDMTGHNHR I GVG AE AW 
Sbjct: 121 S AS Y VF F S QLG I RNKD S RNTLN I GAG VRT F QQ S WM YG FNT F YDNDMTGHNHR I G VGAE AW 180 

Query: 181 TDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 24 0 

TDYLQLSANGYFRLNGWHQSRDF DYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 
Sbjct: 181 TDYLQLSANGYFRLNGWHQSRDFVDYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGE 240 

Query: 241 RVAL FGKDNL QSNP YAVTTGL I YT P I P F I TLG VDQRMGKS RQHE I QWNLQMD YRLGE S FR 300 

RVALFGKDNLQSNPYAVTTGLIYTPIPFITLGVDQRMGKSRQHEI'QWNLQMDYRLGESFR 
Sbjct: 241 RVALFGKDNLQSNP YAVTTGL I YTP I P F I TLGVDQRMGKSRQHE I QWNLQMD YRLGE S FR 300 

Query: 301 SQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQ 360 

SQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQ 
Sbjct: 301 SQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQ 360 

Query: 361 IQSQSALQRILWNDAQWVAAGGKLIPVSATDYNWLPPYKPMAPASRTVGKTGESEAAVN 420 

IQSQSALQRILW+DAQW-fA GGKLIPVSAT+YNWLPPYKPMAPASRTVGKTGESEAAVN 
Sbjct: 3 61 IQSQSALQRILWDDAQWIAVGGKLIPVSATNYNWLPPYKPMAPASRTVGKTGESEAAVN 42 0 

Query: 421 TYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDS 4 80 

TYTLSATAIDNHGMSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNID 
Sbjct: 421 TYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDG 480 

Query: 481 TPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRF 54 0 

TPVAEQEGVI TTSNGALPSKVTKKTDAQGVI S I ALTSFTVG SWTLDIQGQQATVDVRF 
Sbjct: 481 TPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGGSWTLDIQGQQATVDVRF 540 

Query: 541 AVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP 600 

AVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNI<CIsINEFVSGITDLEFIQSGVPVTISP 
Sbjct: 541 AVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP 600 

Query: 601 VTENADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 660 

VTENADMYTASWGMSVGDVDITPQVGGESLDLLQKRITLYP+PKIT I VNGEQFATDK 
Sbjct: 601 VTEMADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPIPKITSIAVNGEQFATDK 660 
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Query: 661 GFPKTTFNKATFQLVMNDDVAJMTQYDWTSSYAASAPVDNQGKVNIAYKTYGSTVTVTAK 72 0 

GFPKTTFNKATFQLVMNDDVANNTQYDWTSSYAASAPVDNQGKVNIAYKTYGSTVTVTAK 
Sbjct: 661 GFPKTTFNKATFQLVMNDDVANNTQYDWTSSYAASAPVDNQGKVNIAYKTYGSTVTVTAK 720 

Query: 721 SKKFPSYTATYQFKPNLWVFSGTMSLQSSVEASRNCQRTDFTALIESARASNGSRSPDGT 780 

SKKFPSYTATYQFKPNLWVFSGTMSLQSS+EASRNCQRTDFTALIESARASNGSRSPDGT 
Sbjct: 721 S KKFPS YTATYQFKPNLWVFSGTMSLQSS I EASRNCQRTDFTAL I ES ARASNGSRS PDGT 780 

Query: 781 LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTGDIPTSAATAYPLCAEPQ 83 5 

LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTG IPTSAATAYPLCAEPQ 
Sbjct: 7 81 LWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTTGAIPTSAATAYPLCAEPQ 83 5 



sp PI 192 2 Invasin [Yersinia pseudotuberculosis] 985 AA 

INVA_YERPS align 

Score = 726 bits (1874), Expect = 0.0 

Identities = 381/695 (54%), Positives = 481/695 (68%), Gaps = 43/695 (6%) 

Query: 4 5 NPAEHNEAFNKIISTGTSLAVSGNASNITRSMVNDAAN^ 104 

N + EA NK+ISTG LA SG AS + + SMV DA NQE+K WLNRFGT QVN+NFDK 
Sbjct: 102 NKEQETEAWKMISTGARLAASGRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKN 161 

Query: 105 FSLKESSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDN 164 

FSLKESSLDWL PWYDSAS++FFSQLGIRNKDSRNTLN+G G+RT + W+YG NT YDN 
Sbjct: 162 FSLKESSLDWLAPWYDSASFLFFSQLGIRNKDSRNTLNLGVGIRTLENGWLYGLNTFYDN 221 

Query: 165 DMTGHNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPA 224 

D+TGHNHRIG+GAEAWTDYLQL+ANGYFRLNGWH SRDF-fDY ERPA-fGGD-f AYLPA 
Sbjct: 222 DLTGHNHRIGLGAEAWTDYLQLAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPA 2 81 

Query: 22 5 LPQLGGKLKYEQYRGERVALFGKDNLQSNPYAVTTGliIYTPIPFITLGVDQRMGKSRQHE 2 84 

LPQLGGKL YEQY GERVALFGKDNLQ NPYAVT G+ YTP + P +T+GVDQRMGKS +HE 
Sbjct: 282 LPQLGGKLMYEQYTGERVALFGKDNLQRNPYAVTAGINYTPVPLLTVGVDQRMGKSSKHE 341 

Query: 2 85 IQWNLQMDYRLGESFRSQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSP 344 

QWNLQM+YRLGESF+SQ SP+ VAGTRLLAESRYNLV+RN NIVLEYQKQ +KL SP 
Sbjct: 342 TQWNLQMNYRLGESFQSQLSPSAVAGTRLLAESRYNLVD^ 401 

Query: 345 AVLSGLPGQVYSVSAQIQSQSALQRILWNDAQWVAAGGKLIPVSATDYNWLPPYKPMAP 404 

A +SGLPGQVY V+AQ+Q SA++ I+W+DA+ +AAGG L P+S T +M+VLPPYK A 
Sbjct: 402 ATISGLPGQVYQWAQVQGASAVREIVWSDAELIAAGGTLTPLSTTQFNLVLPPYKRTAQ 461 

Query: 4 05 ASRTVGKTGESEAAVNTYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALAD 4 64 

SR 4- N Y+LSA A+D+ GN SN TL+V VQQPQ +T+ V DGA A+ 

Sbjct: 462 VSRVT DDLTANFYSLSALAVDHQGNRSNSFTLSVTVQQPQLTLTAAVIGDGAPAN 516 

Query: 4 65 GRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSV 524 

G+T ITV+FTV + + P+A QE VITT+NGALP+K+T+KTDA GV IALT+ T GV+V 
Sbjct: 517 GKTAITVEFTVADFEGKPLAGQEWITTNNGALPNKITEKTDANGVARIALTNTTDGVTV 576 

Query: 525 VTLD I QGQQATVD VRFAVLP PD VTNS S FNVS P SD I VADGSMQS I LTF VPRNKNNEF VS G I 584 

VT +++GQ+ +VD F S + P+ I+ADG M S fT ++ + +G 

Sbjct: 577 VTAEVEGQRQSVDTHFVKGTI AADKSTLAAVPTS I IADGLMASTTTLELKDTYGDPQAGA 636 
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Query: 585 T DLE F I QS G VPVTI S P VTENAD - NYTAS WGNS VGD VD I TPQ VGGE S LDLLQKR I TL 640 

D GV . +T++ D Y+A + ++G +T +V G + 
Sbjct: 637 NVAFDTTLGNMGV 1 TDHNDGT YS APLTS TTLG VAT VT VKVD GAA 68 0 

Query: 641 YPVPKITGINWGEQFATDKGFPKTTFNKATFQLVMOTDVANNTQYDWTSSYAASAPVDN 70 0 

+ VP +T VN F D P 4-4-+F + D +A4- T SS 4- PVD 

Sbjct: 681 FSVPSVT VN FTAD PI PDAGRS S FTVS TPD I L ADGTM SSTLSFVPVDK 727 

Query: 701 QGKV N I A YKT YGS T VT VT AKS KKF P S YT AT 730 

G +4-+ G V+ + + 4-4-4- SYTAT 

Sbjct: 728 NGHFISGMQGLSFTQNGVPVSISPITEQPDSYTAT 762 



Score = 386 bits (991) , Expect = e-105 

Identities = 205/376 (54%), Positives = 262/376 (69%), Gaps = 10/376 (2%) 

Query: 462 LADGRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVG 521 

+ADG T+ 4-+ PA T+G+ 4-T DGS LTS T+G 

Sbjct: 612 I ADGLMAST I TLELKDTYGDPQAGANVAFDTTLGNM - GVI TDHND - - GTYS APLTS TTLG 668 

Query: 522 VSWTLDIQGQQ ATVDVRFAVLP - PDVTNSSFNVSPSDIVADGSMQS ILTFVPRNKN 577 

V4- VT+ 4- G 4-V V F P PD SSF VS DI+ADG+M S L+FVP 4-KN 

Sbjct: 669 VATVTVKVDGAAFS VPS VTVNFTADP I PDAGRS S FTVS TPD I LADGTMS STLS FVPVDKN 728 

Query: 578 NE F VS G I TDLEF I QS GVP VT I S PVTENADNYTAS WGNS VGD VD I TPQVGGE S LDLLQKR 637 

F+SG+ L F Q4-GVPV4-ISP+TE D+YTA+WGNSVGDV ITPQV L LQK+ 
Sbjct: 729 GHFISGMQGLSFTQNGVPVSISPITEQPDSYTATWGNSVGDVTITPQVDTLILSTLQKK 788 

Query: 63 8 ITLYPVPKITGINVNGEQFATDKGFPKTTFNKATFQLVMNDDVANNTQYDWTSSYAASAP 697 

I+L+PVP 4-TGI VNG+ FATDKGFPKT F ATFQL M+4-DVANNTQY4-W+SS4- + 
Sbjct: 789 I S L F P VPTL TG I L VNGQNFATD KG F P KT I F KNATF QLQMDND VANNTQ YE WS S S F T PNVS 848 

Query: 698 VT)NQGKVNIAYKTYGSTVTVTAKSKKFPSYTATYQFKPNLWVFSGTMSLQSSVEASRNCQ 757 

V4-4-QG4-V I Y4-TY S V VTAKSKKFPSY+ 4-Y4-F PN W + + G SL SS+EASR CQ 
Sbjct: 849 VNDQGQVT I TYQTY - S E VAVTAKS KKFPS YS VS YRF YPNRW I YDGGRSL VS S LEAS RQCQ 907 

Query: 758 RTDFTALIESARASNGSRSPDGTLWGEWGSLATYDSAEWPSGNYWTKKTSTDFVTMDMTT 817 

+D 4-A+4-ES4-RA+NG4-R+PDGTLWGEWGSL Y S + 4-W SG YW KKTSTDF TM4-M T 
Sbjct: 908 GSDMSAVLESSRATNGTRAPDGTLWGEWGSLTAY - SSDWQSGEYWVKKTSTDFETMNMDT 966 

Query: 818 GD I - PTS AATAYPLCA 832 

G 4- P A A+PLCA 
Sbjct: 967 GALQPGPAYLAFPLCA 982 



tr Q56937 Invasin [inv] [Yersinia pestis] 987 AA 

align 

Score = 721 bits (1861), Expect = 0.0 

Identities = 381/696 (54%), Positives = 480/696 (68%), Gaps = 44/696 (6%) 

Query: 45 NPAEHNEAFNKIISTGTSLAVSGNASNITRSMWDAANQEVKHWLNRFGTTQWVNFDKK 104 

N 4- EA NK4-ISTG LA SG AS + 4- SMV DA NQE+K WLNRFGT QVN+NFDK 
Sbjct: 103 NKEQETEAWKMISTGARLAASGRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKN 162 

Query: 105 FSLKESSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDN 164 
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FSLKESSLDWL PWYDSAS-f +FFSQLGIRNKDSRNTLN+G G+RT + W+YG NT YDN 
Sbjct: 163 FSLKESSLDWLAPWYDSASFLFFSQLGIRNKDSRNTLNLGVGIRTLENGWLYGLNTFYDN 222 

Query: 165 DMTGHNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPA 224 

D+TGHNHRIG+GAEAWTDYLQL+ANGYFRLNGWH SRDF+DY ERPA+GGD+ AYLPA 
Sbjct: 223 DLTGHNHRIGLGAEAWTDYLQLAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPA 282 

Query: 225 LPQLGGKLKYEQYRGERVA-LFGKDNLQSNPYAVTTGLIYTPIPFITLGVDQRMGKSRQH 283 

LPQLGGKL YEQY GERVA LFGKDNLQ NPYAVT G+ YTP+P +T+GVDQRMGKS +H 
Sbjct: 2 83 LPQLGGKLMYEQYTGERVALLFGKDNLQRNPYAVTAGINYTPVPLLTVGVDQRMGKSSKH 342 

Query: 2 84 EIQWNLQMDYRLGESFRSQFSPAWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFS 343 

E QWNLQM+YRLGESF-hSQ SP+ VAGTRL L AE S R YNLV+ RN NIVLEYQKQ +KL S 
Sbjct: 343 ETQWNLQMNYRLGESFQSQLSPSAVAGTRLLAESRYNLVDRLJNNIVLEYQKQQVVKLTLS 402 

Query: 344 PAVL SGL PGQVYS VS AQ I Q S QS ALQR I LWNDAQWVAAGGKL I P VS ATD YNWL P P YKPMA 403 

PA +SGLPGQVY V+AQ+Q SA+ + I+W+DA+ +AAGG L P+S T +N+VLPPYK A 
Sbjct: 403 PATISGLPGQVYQVNAQVQGASAVREIWSDAELIAAGGTLTPLSTTQFNLVLPPYKRTA 462 

Query: 4 04 PASRTVGKTGESEAAVNTYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSEVTDDGALA 4 63 

SR + N Y+LSA A+D+ GN SN TL+V VQQPQ +T+ V DGA A 

Sbjct: 463 QVSRVT DDLTANFYSLSALAVDHQGNRSNSFTLSVTVQQPQLTLTAAVIGDGAPA 517 

Query: 464 DGRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVS 523 

G+T ITV+FTV + + P+A QE VITT+NGALP+K+T+KTDA GV IALT+ T GV+ 
Sbjct: 518 S GKTAI TVE FT VAD FEG KPLAGQE W I TTNNGAL PNKI TE KTD ANGVAR I ALTMTTDGVT 57 7 

Query: 524 WTLDI QGQQATVDVRFAVLPPD VTNS S FNVS PSD I VADGSMQS I LTFVPRNKNNE FVSG 583 

WT +++GQ+ -fVD F S+ P+ I+ADG M S +T + +G 

Sbjct: 578 WTAEVEGQRQSVDTHFVKGTIAADKSTLAAVPTSIIADGLMASTITLELKDTYGDPQAG 637 

Query: 584 IT DLEF I QSGVPVTI S PVTENAD - NYTAS WGNS VGDVD I TPQVGGES LDLLQKR I T 639 

D. GV +T+-f D Y+A + ++G +T +V G + 
Sbjct: 638 ANVAFDTTLGNMGV ITDHNDGTYSAPLTSTTLGVATVTVKVDGAA 682 

Query: 64 0 LYPVPKITGINVNGEQFATDKGFPKTTFNKATFQLVMNDDVANNTQYDWTSSYAASAPVD 699 

+ VP +T VN F D P +++F + D +A+ T SS + PVD 

Sbjct: 683 -FSVPSVT VN FTAD PIPDAGRSSFTVSTPDILADGTM SSTLSFVPVD 728 

Query: 700 NQGKV N I AYKT YG S T VT VT AKS KKF P S YT AT 730 

G +++ G V+++ +++ SYTAT 

Sbjct: 729 KNGHF I S GMQGL S FTQNGVP VS ISPITEQPDS YTAT 764 



Score = 380 bits (975) , Expect = e-104 

Identities = 201/376 (53%) , Positives = 260/376 (68%) , Gaps = 10/376 (2%) 

Query: 462 L ADGRT P I T VKFT VTN I D S T P VAE QE G V I TT SNGAL P S KVTKKTD AQG V I S I AL T S FT VG 521 

+ADG T+ ++ PA T+G+ +T DGS LTS T+G 

Sbjct: 614 I ADGLMAST I TLELKDT YGDPQAGANVAFDTTLGNM - GVI TDHND - - GT YS APLTS TTLG 670 

Query: 522 VSWTLDIQGQQ ATVDVRFAVLP-PDVTNSS FNVS PSD I VADGSMQS ILTFVPRNKN 577 

V+ VT+ + G +V V F P PD SSF VS DI+ADG+M S L+FVP +KN 

Sbjct: 671 VATVTVKVDGAAFS VPS VTVNFTADP I PDAGRSS FTVSTPD ILADGTMS STLS FVPVDKN 730 

Query: 57 8 NE F VS G I TDLE F I QSGVPVT I S PVTENADNYTAS WGNS VGDVD I TPQVGGE S LDLLQKR 63 7 

F+SG+ L F Q+GVPV+ISP+TE D+YTA+WGN+ GDV ITP V L LQK+ 
Sbjct: 731 GHF I SGMQGLS FTQNGVP VS I S P I TEQPDS YTAT WGNTAGDVTITPLVDTL I LSTLQKK 790 
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Query: 63 8 ITLYPVPKITGINVNGEQFATDKGFPKTTFNKATFQLVMNDDVANNTQYDWTSSYAASAP 697 

I+L+PVP +TGI WG+ FATDKGFPKT F ATFQL M+ +DVANHTQY+W+SS+ + 
Sbjct: 791 I S L F P VPTLTG I L VNGQNFATDKGF PKT I FKNATFQLQMDND VANNTQ YE WS S S FTPNVS 850 

Query: 698 VDNQGKVNIAYKTYGSTVTVTAKSKKFPSYTATYQFKPNLWVFSGTMSLQSSVEASRNCQ 757 

V++QG+V I Y+TY S V VTAKSKKFPSY+ +Y+F PN W++ G SL SS+EASR CQ 
Sbjct: 851 VNDQGQ VT I TYQT Y - S E VAVTAKS KKF PS YS VS YRF YPNRW I YDGGTS LVS S I EAS RQCQ 909 

Query: 75 8 RTDFTAL I ES ARASNGS RS PDGTL WGEWGS L ATYD SAEWP SGNYWTKKTS TD FVTMDMTT 817 

+D +A+H-ES+RA+NG+R+PDGTLWGEWGSL Y S + +W SG YW K+TSTDF TM+M T 
Sbjct: 910 GSDMSAVLESSRATNGTRAPDGTLWGEWGSLTAY-SSDWQSGEYWVKRTSTDFETMNMNT 968 

Query: 818 GDI -PTSAATAYPLCA 832 

G + P A A+PLCA 
Sbjct: 969 GLLQPGPAYLAFPLCA 984 



tr Q8ZA73 Putative invasin (Hypothetical) [YP03944] [Yersinia pestis] 3013 AA 

align 

Score = 428 bits (1100), Expect = e-118 

Identities = 265/703 (37%), Positives = 382/703 (53%), Gaps = 38/703 (5%) 

Query: 66 SGNASNITRSMWDAANQEVKHWLNRFGTTQVISrWFDKKFSLKESSLDWLLPWYDSASYV 125 

S ASN+ RS V + N + WLN+FGT +V +N D F L S + LD L+P DS S + 

Sbjct: 178 SDAASNMARSAVTNEINASSQQWLNQFGTARVQLNV1DSDFKLDNSALDLLVPLKDSESSL 237 

Query: 12 6 FFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAWTDYLQ 185 

F+QLG+RNKDSRNT+NIGAG+R +Q WMYG NT +DND+TG N R+GVGAE TDYL-f 

Sbjct: 23 8 LFTQLGVRNKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLK 2 97 

Query: 186 LSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGERVALF 245 

SAM YF L GWHQSRDF+ Y+ERPA G DI +AYLPA PQLGGKL YE+YRG+ VALF 

Sbjct: 2 98 FSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALF 3 57 

Query: 246 GKDNLQSNP YAVTTGL I YTP I PF I TLG VDQRMGKS RQHE I QWNLQMD YRLGE S FRS QFS P 3 05 

GKD+ Q +P+AVT G+ YTP+P +T+G + R GK + N+Q++YR+G+ + Q 

Sbjct: 358 GKDDRQKDPHAVTLGWYTPVPLVTIGAEHREGKGNNNNTSVWQLNYRMGQPWNDQIDQ 417 

Query: 306 AWAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQIQSQS 365 

+ VA R LA SRY+LVERN NIVL-fY+KQ I L P +SG G +++AQ4- +++ 

Sbjct: 418 SAVAANRTLAGSRYDLVERNNNIVLDYKKQELIHLVL-PDRISGSGGGAITLTAQVRAKY 476 

Query: 366 ALQR I L WND AQWVAAGGKL I P VS ATD YNWLP P YKPMAPASRT VGKTGES E AAVNT YTL S 425 

RI W+ AGG P++- + +V LP Y+ + S NT+T+S 

Sbjct: 477 GFSRIEWDATPLENAGGSTSPLTQSSLSVTLPFYQHILRTS NTHTIS 523 

Query: 426 ATAIDNHGNSSNPATLTVIVQQPQFVITSEV- -TDDGALADGRTPITVKFTVTNIDSTPV 483 

A A D GN+SN A ++ V +P+ ++ S + T D A A+G TV+ TVT+ D P+ 

Sbjct: 524 AVAYDAQGNASNRAVTS IEVTRPETMVI SHLATTIDNATANGI ATNTVQATVTDGDGQP I 583 

Query : 484 AEQEGViTTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRFAV- 542 

Q +AS +TAGSLT GVS V+ + + +VD F 

Sbjct : 584 I GQL INFAVNTQATL S TTE ARTGANGTAS TTLTHT VS GVS RVS VTLGS S S RS VDTTF VAD 643 
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Query: 543 -LPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISPV 601 

+ +T 44- V4 +D VA+GS 44-4 + 4- + +G V V 

Sbjct: 644 ESTAEITAANLTVTTNDSVANGSDTNWRAKVTDAYTNAVANQSVIFSASNGATVIDQTV 703 

Query: 602 TENADNYTASWGNSVGDVD-ITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 660 

NA+ S + N+ V +T +GG+S Q TP V A 

Sbjct: 704 ITNAEGIADSTLTNTTAGVSWTATLGGQS QQVDTTFKPGSTAAISLVKLADRAVAD 760 

Query: 661 G F P KTTFNKAT FQL VMNDD VAN NTQYDWTSSYAASAP - VDNQGKVNIAY K 709 

G + + Q+V+ DN + Q D 4 AS P G +N + + 

Sbjct: 761 G IDQNE IQWLRDGTGNAVPNVPMS IQADNGAI WASTPNTGVDGTINATFTNLR 815 

Query: 710 TYGSTVTVTAKSKKFPSYTATYQFKPNLWVFSGTMSLQSSVEA 752 

S V+VT+ + + T T+ P V S ++ ++ +A 
Sbjct: 816 AGE S WS VT S P ALiVGMTMTMTF SAD PRTAWS TL AA I DNNAKA 858 

Score = 122 bits (305) , Expect = 4e-26 

Identities = 123/477 (25%) , Positives = 199/477 (40%) , Gaps = 28/477 (5%) 

Query: 3 74 DAQWVAAGGKLIPVSATDYNWLPPYKPMAPASRTVGKTGESEAAVNTYTLSATAIDNHG 43 3 

DA GG-f + SAT+- 4- 4+ +V + S A 4- 4+ T + N4 

Sbjct: 2548 DANQNPVGGQQVAFSATNEVTLTESNGSISTPEGSVLLSVTSTQA-GVHPITGTLVSNNY 2606 

Query: 434 MSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGVITTS 4 93 

+ A Q + T V D+ ALADG T V+ V + VA+ T + 

Sbjct: 2607 TDTFGAAF I ANKNTAQ - L S TLMVVDNNAL ADGVTRNQ VRAHVVD S TGNS VADMAVTF TAN 2665 

Query: 4 94 NGALPSKVTKKTDAQGVISIALTSFTVGVSWT- -LDIQGQQATVDVRFAVLPPDVTNSS 551 

GA SKVT TD G LT+- VGV+WT L G TVD F P + + 

Sbjct: 2666 RGAQLSKVTVLTDNNGDAVNTLTNSLVGVTWTAKLGTAGTPLTVDTVFTAGP- -LATLT 2723 

Query: 552 FNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP-VTENADNYTA 610 

+ ++ AD S + + + + G +G +T + NA+ 

Sbjct: 2724 LVTTVNNAFADNSATNTVQATLKDVSGNPIVGEWAFAASNGATITATDGGVSNANGIVL 2783 

Query : 611 SWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDKGFPKTTFNKA 670 

+ +N V V L+T+K-f VNG F D GFP T F A 

Sbjct: 2784 ATLTNGTAGVS TVTATIETLTETTDTTFIAMKNLDVTVNGTTFNGDAGFPTTGFVGA 2840 

Query: 671 TFQLVMNDDVANNTQYDWTSSYAASAPVDNQGKV- -NIAYKTYGSTVTVTAKSKKFPSYT 72 8 

TF++ D N+ YDW+SS A V G V N + T T+T++A K S 
Sbjct: 2841 TFKVNSGGD NS L YDWS S S APALVS VS GDG WTFNAVF PTGTPT I T I S ATP KGGGS - P 2896 

Query: 72 9 ATYQFKPNLWVFSGTMSLQSSVEASRNCQRTDF TAL I E S ARASNGSRS PDGTLWG 783 

+Y F+ N W + + + +A 4C+ 4 T 4 4A 4G R4 G LW 

Sbjct: 2897 LSYSFRWQWFINNNGATLNRADAITHCENVGYTMPTSTQVTNAATWMSGKRAV-GNLWS 2955 

Query: 7 84 EWGSLATYDSAEW - PSGNYWTKKT - - STDFVTMDMTTGDI PTSA ATAYPLCAEP 834 

EWG 4 Y 4 W P4 4W 44- + 4 4-4TG 4 T A 4- +C P 

Sbjct: 2 956 EWGDFSAYTAPGWVPAEFFWLSNNHDASTALAIGLSTGTLTTMGDFMAITHVMCTRP 3 012 
Score = 60.1 bits (144), Expect = 2e-07 

Identities = 54/203 (26%), Positives = 86/203 (41%), Gaps = 7/203 (3%) 
Query: 418 AVNTYTLSATAIDNH- -GNSSNPATLTVIVQQPQFVITSE VTDDGALADGRTP I TVK 472 
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A T+TL+ GN T I + I + +T + ALADG + 

Sbjct: 1983 ATLTHTLAGTSWTARVGNRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATNAAR 2042 

Query: 473 FTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDI- -Q 530 

VT+ + PV T+ NGAL + + TD+ G S T T G+S VT I 

Sbjct: 2043 VIVTDANGNPVPSMFVGYTSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAIVTM 2102 

Query : 531 GQQATVD VRFAVLP PDVTNS S FNVS P SD I VADGSMQS I LTF VPRNKNNE F VSG I TDLE F I 590 

G T D F S V +D +A+ S + + I+ + + + V+G+ 

Sbjct: 2103 GISQTKDAVFIADRSTAHVSELIWKNDSLAN^ 2162 

Query: 591 QSGVPVTISPVTENADNYTASW 613 

V +T + VT N+ Y + + 
Sbjct: 2163 TENVTLTANTVTTNS QG YAENTL 2185 



Score =53.5 bits (127), Expect = 2e-05 

Identities =62/250 (24%), Positives = 93/250 (36%), Gaps = 10/250 (4%) 

Query : 3 73 NDAQWVAAGGKL I P VS ATD YNWL PP YKPMAPAS RT VGKTGE S E AAVNT YTLS ATAI DNH 432 

NQV+ PVTW A + +GTG ATT+ N 

Sbjct : 962 NSVQAWSDSGGNPV- -TGATVVFSSTNATAQVTTVIGTTGVDGIATATLTNTVAGTSNV 1019 

Query: 433 G NSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGV 489 

+ + N TV IT +GA+ADG V V + + P+ V 

Sbjct: 102 0 VAT I DTVNAN I DTAFVAGAVAT I TLTAP VNGAVADGAD TNQ VDALVEDANGNP I TGAAW 1079 

Query: 4 90 ITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRFAVLPPDVTN 54 9 

+ + +NGA TT GVSLT GSV + A +D F + 1 V 

Sbjct: 1080 FSSANGATILSSTMNTG vlSTGVASTLLTHTVAGTSlNlVVATVDTvTSIANIDTTF - - VAGAVAT 1137 

Query: 550 SSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISPV- -TENADN 607 

+ + VADG+ + + V + + V+G + F + 1+ V T AD 

Sbjct: 113 8 I TLTTP VNGAVADGANSNS VQAWS DSDGNP VTGAA - WFS S ANATAQ I TT V I GTTGADG 1196 

Query: 60 8 YTASWGNSV 617 

+ + N+V 
Sbjct: 1197 IATATLTNTV 12 06 



Score = 51.6 bits (122), Expect = 6e-05 

Identities = 63/269 (23%), Positives = 108/269 (39%), Gaps = 30/269 (11%) 

Query : 45 8 DDGALADGRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVIS IALTS 517 

+DGA+ADG V+V++ V V+MG + +TDA G + + + + 

Sbjct: 2226 NDGAVADGIQTNRVEARVYDVSDNLVPNSNWFSADNGGQLVQNDVQTDALGSAYVTVSN 2285 

Query: 518 FTVGVSWTLDIQGQQATVDVRFAVLPPDVT - -NSS FNVS PSD I VADGSMQS I LTF VPRN 575 

GV+ VT+ G A+ F T F ++ + VA+G +++ + + 

Sbjct: 2286 INTGVTKVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGWENRVLLHLVD 2345 

Query: 576 KNNE F VS G I TDLE F I QSGVP VT I S PVTENADN YTASWGNSV - GD VD I TPQ VGGE SLDLL 634 

N+ VSG-h +G ■ + S +T+ + + V+ N++ G D+T 
Sbjct: 2346 ANDNSVSGVEVNFSATNGASINASAITD - INGFAIGVLTNTLSGPSDVT 2393 

Query: 635 QKRITLYPVPKITGINWGEQFATDKGFPKTTFNKATFQLVKNDDVANNTQYDWTSSYAA 694 

+TL + V QF D T N AT V+ DD A D + + 

Sbjct: 2394 VTLVTPGGTESLTVT - PQF I AD INTANIATGDFVIIDDGAVANSVD- -ANEVR 2443 
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Query: 695 SAPVDNQGKVNIAYKTY GSTVTVT 718 

+ DNQG Y G+T+T + 

Sbjct: 2444 ARVTDNQGNAI AGYS VVFSS QNGATI TTS 2472 



Score = 48.5 bits (114), Expect = 5e-04 

Identities = 72/292 (24%) , Positives = 101/292 (33%) , Gaps = 39/292 (13%) 

Query : 3 73 NDAQWVAAGGKLIPVSATDYNWLPPYKPMAPASRTVGKTGESEAAVNTYTLSATAIDNH 43 2 

NQV+ PVTW A + +G TG A T T + N 

Sbjct: 1155 N S VQAWS D S DGNP V - - TG AAWF S S ANAT AQ I TTVI GTTGADG I AT ATLTNT VAGTS NV 1212 

Query: 433 G NSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGV 489 

+ + N TV IT +GA+ADG V V + + + V 

Sbjct: 1213 VAT I DT VNAN IDTAFVAGA VAT I TLTAP VNGAVADGADTNQ VDAL VQD ANGNA I TGAAW 1272 

Query: 490 ITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRFAVLPPDVTN 549 

+++NGA TT GVSLT GSV I A+DF + V 

Sbjct: 1273 FSSANGADI IAPTMNTGVNGVASTLLTHTVAGTSNVVATIDTI SANIDTAF - - VAGAVAT 1330 

Query: 550 SSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSG 583 

+ + VADG-f + + + + N + +"G 

Sbjct: 1331 I TLTAP VNGAVADGADTNQVDALVEDANGNP I TGAAWFS S ANGAT I LS S TMNTGVNGVA 1390 

Query: 584 1 TDLEF I QS GVP VT I S P VTENADNYTAS WGNS VGD VD I TPQ VGGE S LD 632 

+T S V TI VTEN D TA V G +V + "+T V G D 

Sbjct: 1391 STFLTHTVAGTSNWATIGSVTENID - -TAFVAG-AVATITLTAPVNGAVAD 143 9 



Score = 47.0 bits (110), Expect = 0.001 

Identities = 46/178 (25%), Positives = 69/178 (37%), Gaps = 5/178 (2%) 

Query: 442 TVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKV 501 

TV IT +GA+ADG V V + + P+ V +++NGA 

Sbjct: 1321 TAF VAGAVAT I TLTAP VTSFGAVADGADTNQ VT)ALVEDANGNP I TGAAWFS S ANGAT I LS S 13 80 

Query: 502 TKKTDAQGVI S I ALTSFTVGVS WTLD IQGQQATVDVRFAVLPPDVTNS S FNVS PSD I VA 561 

TTGVSLT GSV I +DF+V+ + VA 

Sbjct: 1381 TMNTGVNGVASTFLTHTVAGTSNWATIGSVTENIDTAF - - VAGAVAT I TLTAP VNGAVA 1438 

Query: 562 DGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISPV- -TENADNYTASWGNSV 617 

DG + + V + + V+G T+F+ I+VTAD ++ N+V 

Sbjct: 143 9 DGVNTNSVQAWSDSDGNAVTGAT-WFSSANATAQITTVIGTTGADGIATATLTNTV 1495 

Score =45.1 bits (105), Expect = 0.006 

Identities = 52/210 (24%), Positives = 90/210 (42%), Gaps = 15/210 (7%) 

Query: 42 5 SATAIDNHGNSSNPATLTVI- -VQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTP 4 82 

STI G A+T+V Q +TS + D LA+G+ + VT+ 

Sbjct: 1595 SNTVIAISGAHQGYARVTFVADVSTAQLKLTSFL - -DNQLANGKAGNIAQALVTDAHDNL 1652 

Query: 4 83 VAEQEGVITTSNGALPSKVTKKTDAQGVI SI ALTSFTVGVS W- -TLDIQGQQATVDVRF 54 0 

+A Q NGA+ + A G++ + + G++ V TLD GQ T++ F 

Sbjct: 1653 LANQSVSFALDNGAVIESQGDASSASGIVLMRFNNTLAGMTTVTATLDSTGQTETLETHF 1712 

Query: 541 A VLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVT 597 



http://us.expasy.org/cgi-bin/blast.pl 



9/14/04 



ExPASy BLAST2 Interface 



Page 10 of 13 



++T + N + + I + +Q ++T V N N V +T SG+ +T 

Sbjct: 1713 VAGKAAS I EMTMTKDNAVANNIDTN - EVQVLVTDVDGNAINGAWNLTS NSGMNIT 1767 

Query: 598 I S P VTENAD - NYTAS VVGNS VGD VD I TPQV 626 

+ VT +D TA++ G + I ++ 

Sbjct: 1768 PNSVTTGSDGTATATLTHTLAGSLPINARI 1797 



Score = 43.9 bits (102), Expect = 0.013 

Identities = 34/114 (29%), Positives - 51/114 (43%), Gaps = 4/114 (3%) 

Query: 456 VTDDGALADGRTPITVKFTVTNIDSTPVAEQEGVITTSNGALPSKVTKKTDAQGVISIAL 515 

+ DDGA+A+ V+ VT+ +A V ++ NGA + + T G S L 

Sbjct: 2427 IIDDGAVANSVDANEVRARVTDNQGNAIAGYSWFSSQNGATIT-TSGITGVDGWASAKL 2485 

Query: 516 TSFTVGVSWTLDIQGQQATVDVRFAVLPPDVTNSS FNVSPSDIVADGSMQ 566 

T G S + + ATV DV+ ++ FN +P I+ADG MQ 

Sbjct: 2486 THI KAGESGI LARLSRPMATVHTLMP YF I ADVSTATLQLFNFNP I PI I ADGVMQ 2539 

Score =43.9 bits (102), Expect = 0.013 

Identities = 59/303 (19%), Positives = 120/303 (39%), Gaps = 28/303 (9%) 

Query: 424 LSATAIDNHGNSSNPATLTVIVQQPQFVITSE- -VTDDGALADGRTPITVKFTVTNIDST 481 

+ +AT + + G+S N T + + S+ V D A+AD V VT+ 

Sbjct: 1893 VTATWSS-GSSRNIDTTFIADVTTAHIAASDLMVIVDDAVADNLDKNEVHARVTDAKGN 1951 

Query: 482 PVAEQEGVITTSNGALPSKVTKKTDAQGVI S I ALTS FTVGVS WTLD I QGQQATVDVRFA 541 

++ Q + T+ NGA + V +D G+ LT G SWT + + + D F 
Sbjct: 1952 VLSGQTVIFTSGNGAAITTVNGISDGDGLTKATLTHTLAGTSWTARVGNRVQSKDTTFI 2011 

Query: 542 V- -LPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVT- I 598 

+ S ++ ++ +ADG + + + N V + +G +T 

Sbjct: 2012 ADRTTAT I RAS DLT I TRNNAL ADGVATNAARVI VTDANGNP VP SMF VG YTSDNGALLTPT 2071 

Query: 599 SPVTENADNYTASWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFAT 658 

S +T+++ ++++G+T++++ F 
Sbjct: 2072 S GMTDS S GT FS TTFTHTTAG I S KVTAAI VTMG I S QTKDAV FIA 2114 

Query: 659 DKGFPKTTFNKATFQLWINDDVANNTQYDWTSSYAASAPTONQGKVNIAYKTYGS^^ 718 

D ++T + + +V ND +ANN+ + ++ A + 4-N+ + VT+T 

Sbjct: 2115 D----RSTAHVSELIWKNDSLANNSDRNIVQAHIKDAHGNWTGMNWFSA-TENVTLT 2169 

Query: 719 AKS 721 

A + 

Sbjct: 2170 ANT 2172 



Score = 39.3 bits (90), Expect = 0.31 

Identities = 34/158 (21%) , Positives = 64/158 (39%), Gaps = 1/158 (0%) 

Query: 459 DGAL AD GRT P I T VKF T VTN I D S T P VAEQE G V I TT S NG AL P S KVT KKTD AQG V I S I ALT S F 518 

D A+ADG ++ + + V I NGA+ T T G 1+ T+ 

Sbjct: 755 DRAVADGIDQNEIQWLRDGTGNAVPNVPMSIQADNGAIVVASTPNTGVDGTINATFTNL 814 

Query: 519 TVGVSWTLDIQGQQA-TVDVRFAVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKN 577 

G SW++ T+ + F+ P S+ ++ ADG+ + + + + N 

Sbjct: 815 RAGESWSVTSPALVGMTMTMTFSADPRTAWSTLAAIDNNAKADGTDTNWRAWVVDAN 874 
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Query: 578 NE F VS G I TDLE F I Q S GVPVT I S P VTENADN YTAS WGN 615 

V G+ + +G + +PV + + Y + + N 

Sbjct: 875 GNSVPGVSVTFDAGNGAVLAQNPWTDRNGYAENTLTN 912 



tr Q74QZ6 Putative invasin [YP3306] [Yersinia pestis] 3108 AA 

align 



Score = 428 bits (1100), Expect = e-118 

Identities = 265/703 (37%), Positives = 382/703 (53%), Gaps = 38/703 (5%) 

Query : 66 SGNASNITRSMWDAANQEVKHWLNRFGTTO 125 

S ASN+ RS V + N + WLN+FGT +V +N D F L S+LD L+P DS S + 
Sbjct: 178 SDAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSL 237 

Query: 126 FFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHMHRIGVGAEAWTDYLQ 185 

F+QLG+RNKDSRNT+NIGAG-fR +Q WMYG NT +DND+TG M R+GVGAE TDYL+ 
Sbjct: 238 LFTQLGVRNKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLK 297 

Query: 186 LSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGERVALF 245 

SAN YF L GWHQSRDF+ Y+ERPA G DI +AYLPA PQLGGKL YE+YRG+ VALF 
Sbjct: 298 FSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALF 357 

Query: 246 GKDNLQSNP YAVTTGL I YTP I PF ITLGVDQRMGKSRQHE I QWNLQMD YRLGES FRSQFS P 305 

GKD + Q +P+AVT G+ YTP+P +T+G + R GK + N+Q++YR+G+ + Q 
Sbjct: 358 GKDDRQKDPHAVTLGWYTPVPLVTIGAEHREGKGNNNNTSVWQLNYRMGQPWNDQIDQ 417 

Query: 306 AWAGTRLLAE S R YNL VERNPN I VLE YQKQNT I KL AF S P AVLS GLPGQVYS VS AQ I Q S Q S 365 

+ VA R LA SRY+LVERN NIVL+Y+KQ I L P +SG G +++AQ++++ 
Sbjct: 418 SAVAANRTLAGSRYDLVERNNNIVLDYKKQELIHLVL - PDRISGSGGGAITLTAQVRAKY 476 

Query: 3 66 ALQR I LWNDAQW VAAGGKL I P VS ATD YNWL P P YKPMAP AS RT VGKTGE S E AAVNT YTL S 425 

RI W+ AGG P++ + +V LP Y+ 4- S NT+T+S 
Sbjct: 477 GFSRIEWDATPLENAGGSTSPLTQSSLSVTLPFYQHILRTS NTHTIS 523 

Query: 426 ATAIDNHGNSSNPATLTVIVQQPQFVITSEV- -TDDGALADGRTPITVKFTVTNIDSTPV 483 

A A D GN+SN A ++ V +P+ ++ S + T D A A+G TV+ TVT+ D P+ 

Sbjct: 524 AVAYDAQGNASNRAVTSIEVTRPETMVISHLATTIDNATANGIATNTVQATVTDGDGQPI 5 83 

Query: 484 AEQEGVITTSNGALPS KVTKKTDAQGVI S I ALTS FTVGVS WTLD I QGQQATVDVRFAV - 542 

Q + A S +T A G S LT GVS V++ + +VD F 

Sbjct: 584 IGQLINFAVNTQATLSTTEARTGANGTASTTLTHTVSGVSRVSVTLGSSSRSVDTTFVAD 643 

Query: 543 -LPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISPV 601 

++T + + V+ +D VA+GS + + + + V+ + + +G V V 

Sbjct: 644 E S TAE I TAANLTVTTND S VANGS DTNVVRAKVTDAYTNAVANQ S V I FS AS NGAT V I DQTV 703 

Query: 602 TENADNYTASWGNSVGDVD - ITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDK 660 

NA+ S + N+ V +T +GG+S Q T P V A 

Sbjct: 704 ITNAEGIADSTLTNTTAGVSWTATLGGQS QQVDTTFKPGSTAAISLVKLADRAVAD 760 

Query: 661 GFPKTTFNKATFQLVMNDDVAN NTQYDWTSS YAASAP - VDNQGKVNIAY K 709 

G ++ Q+V+ DN + Q D + AS P G +N + + 

Sbjct: 761 G IDQNEIQWLRDGTGNAVPNVPMSIQADNGAIWASTPNTGVDGTINATFTNLR 815 
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Query: 710 TYGSTVTVTAKSKKFPSYTATYQFKPNLWVFSGTMSLQSSVEA 752 

S V+VT+ + + T T+ P V S ++ ++ +A 
Sbjct: 816 AGES WSVTS PAL VGMTMTMTFSADPRTAVVSTLAAIDNNAKA 858 



Score = 122 bits (305) , Expect = 4e-26, 

Identities = 123/477 (25%), Positives = 199/477 (40%), Gaps = 28/477 (5%) 

Query: 3 74 DAQWVAAGGKL I P VS ATD YNWL PP YKPMAP AS RTVGKTGE S E AAVNT YTLS ATA I DNHG 433 

DA GG+ + SAT+ + + + +V + S A + ++ T + N+ 

Sbjct: 2643 DANQNPVGGQQVAFSATNEVTLTESNGSISTPEGSVLLSVTSTQA-GVHPITGTLVSNWY 2701 

Query: 4 34 NSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGVITTS 4 93 

+ A Q + T V D+ ALADG T V4- V 4- VA+ T + 

Sbjct: 2702 TDTFGAAF I ANKNTAQ - LSTLMWDNNALADGVTRNQVRAHWDSTGNSVADMAVTFTAN 2760 

Query: 494 NGALPS KVTKKTDAQGVI S I ALTS FTVGVS WT - - LD I QGQQATVDVRFAVLPPDVTNS S 551 

GA SKVT TD G LT+ VGV+WT L G TVD F P + + 

Sbjct: 2761 RGAQLSKVTVLTDNNGDAVNTLTNSLVGVTWTAKLGTAGTPLTVDTVFTAGP--LATLT 2 818 

Query: 552 FNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFIQSGVPVTISP-VTENADNYTA 610 

+ 4-4- AD S 4- + ++ + + G +G +T + NA+ 

Sbjct: 2 819 LVTTVNNAFADNSATNTVQATLKDVSGNPIVGEWAFAASNGATITATDGGVSNANGIVL 2 878 

Query: 611 SWGNSVGDVDITPQVGGESLDLLQKRITLYPVPKITGINVNGEQFATDKGFPKTTFNKA 670 

4- + N V V L+T + K 4- VNG F D GFP T F A 

Sbjct: 2879 ATLTNGTAGVS TVTATIETLTETTDTTFIAMKNLDVTVNGTTFNGDAGFPTTGFVGA 2935 

Query : 671 TFQLVMNDDVANNTQYDWTSSYAASAPVDNQGKV- -NIAYKTYGSTVTVTAKSKKFPSYT 72 8 

TF4- + D N+ YDW+SS A V G V N + T T+T++A K S 
Sbjct: 2936 TFKVNSGGD- - -NSLYDWSSSAPALVSVSGDGWTFNAVFPTGTPTITISATPKGGGS-P 2991 

Query: 72 9 AT YQFKPNLWVF SGTMS LQS S VE ASRNCQRTD F TAL I E S ARASNGSRS PDGTLWG 783 

+Y F+ N W 4- + + +A +C+ + T + +A +G R+ G LW 

Sbjct: 2 992 LSYSFRWQWFINNNGATLNRADAITHCENVGYTMPTSTQVTNAATWMSGKRAV-GNLWS 3 050 

Query : 784 EWGSLATYDSAEW-PSGNYWTKKT- -STDFVTMDMTTGDIPTSA ATAYPLCAEP 834 

EWG + Y + W P+ +W 4-4- + + ++TG 4~ T A + +C P 

Sbjct: 3051 EWGDFSAYTAPGWVPAEFFWLSNNHDASTALAIGLSTGTLTTMGDFMAITHVMCTRP 3107 

Score = 60.1 bits (144), Expect = 2e-07 

Identities = 54/203 (26%) , Positives = 86/203 (41%) , Gaps = 7/203 (3%) 

Query: 418 AVNTYTLSATAIDNH- -GNSSNPATLTVIVQQPQFVITSE VTDDGALADGRTPITVK 4 72 

A T+TL+ T+4- GN T I 4- I + +T 4- ALADG + 

Sbjct: 2078 ATLTHTLAGTSWTARVGNRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATNAAR 2137 

Query: 4 73 FTVTNIDSTPVAEQEGVITTSNGALPS KVTKKTDAQGVI SI ALTS FTVGVS WTLDI- -Q 53 0 

VT+ 4- PV T+ NGAL 4- 4- TD+ G S T T G4-S VT I 

Sbjct: 213 8 VIVTDANGNPVPSMFVGYTSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAIVTM 2197 

Query: 531 GQQATVI)VRFAVLPPDVTNSSFNVSPSDIVADGSMQSILTFVPRNKNNEFVSGITDLEFI 5 90 

G T D F S V 4-D 4-A+ S 4-4-1+ 4-4- 4- V4-G4- 

Sbjct: 2198 G I S QTKDAVF I ADRS TAHVSEL I WKNDS LANNSDRNI VQAH I KDAHGNWTGMNVNFS A 2257 

Query: 5 91 QSGVPVTISPVTENADNYTASW 613 

* V +T 4- VT N+ Y 4- 4- 
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Sbjct: 22 5 8 TENVTLTANTVTTNS QG YAENTL 22 80 



Score = 56.6 bits (135), Expect = 2e-06 

Identities = 63/250 (25%), Positives = 93/250 (37%), Gaps = 10/250 (4%) 

Query: 3 73 ND AQ WVAAGG KL I P VS ATD YNWL P P YKPMAP AS RT VGKTGE S E AAVNT YTL S ATA I DNH 432 

NQV+ PV TW A + +G TG ATT+ N 

Sbjct: 1155 NSVQAWSDSDGNPV- -TGAAWFSSANATAQITTVIGTTGADGIATATLTNTVAGTSNV 1212 

Query: 433 G NSSNPATLTVIVQQPQFVITSEVTDDGALADGRTPITVKFTVTNIDSTPVAEQEGV 489 

++ N TV IT +GA+ADG V V + + P+ V 

Sbjct: 1213 VATIDTWANIDTAFVAGAVATITLTAPVNGAVADGADTNQVDALVEDANGNPITGAAVV 12 72 

Query: 4 90 ITTSNGALPSKVTKKTDAQGVISIALTSFTVGVSWTLDIQGQQATVDVRFAVLPPDVTN 54 9 

+++NGA T T GV S LT G S V + A +D F +P V 

Sbjct: 1273 FSSANGATILSSTMNTGVNGVASTLLTHTVAGTSNWATVDTVNANIDTAF- - VPGAVAT 1330 

Query: 550 S S FNVS PSD I VADGSMQS I LTFVPRNKNNEFVSGITDLEF IQSGVPVTI S PV - - TENADN 607 

+ + VADG+ + + V + V+G + F + 1+ V T AD 

Sbjct: 13 31 ITLTTPVNGAVADGANSNSVQAWSDSEGNAVAGAA-WFSSANATAQITTVIGTTGADG 13 8 9 

Query: 60 8 YTASWGNSV 617 

+ + N+V 
Sbjct: 13 90 IATATLTNTV 13 99 
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Search in Swiss-Prot and TrEMBL for: invasin 

Swiss-Prot Release 44.5 of 13-Sep-2004 
TrEMBL Release 27.5 of 13-Sep-2004 



• Number of sequences found in Swis s-P rot^ ^ and TrEMBL^y 56 

• Note that the selected sequences can be saved to a file to be later retrieved; to do so, go to the 
bottom of this page. 

• For more directed searches, you can use the Sequence Retrieval System SRS. 



Search in Swiss-Prot: There are matches to 11 out of 158316 entries 

INVA YEREN 1£196) 

Invasin. -(Yersini^enterocolitica 
I NVA_Y E RPS (ITO22) 

Invasin. - Yersinia pseudotuberculosis 
IPAA SHIFL (P18010) 

Invasin ipaA (70 kDa antigen). 
/IflexnerK 
IPAB-SHIJDY (Q03945) 

Invasin ipaB (62 kDa antigen). 
IPAB SHIFL (P18011) 

Invasin ipaB (62 kDa antigen). 

flexneri 
IPACJ1HIDY (Q03946) 

Invasin ipaC (42 kDa antigen). 
IPACLSHIFL(P18012) 

Invasin ipaC (42 kDa antigen). 

flexneri ( 
IPAD^SHIDY (Q03947) 

Invasin ipaD (37 kDa membrane antigen). {GENE: Name=ipaD} - Shigella dysenteriae 
IPAD^SHIFL (P18013) 

Invasin ipaD (36 kDa membrane antigen). {GENE: Name=ipaD; OrderedLocusNames=CP0126} 

- Shigella flexneri 
YADA_YEREN (P31489) 

Invasin precursor (Outer.membrane adhesin). {GENE: Name=yadA; Synonyms=yopA, invA, 

yopl} - Yersinia enterocolitica) 
YADA^YERPS (P1085 V 8)^^>^ 



{GENE: Name=ipaA; OrderedLocusNames=CP0125} - Stagella 

{GENE: Name=ipaB} - Shigella ^^^t^^) 

{GENE: Name=ipaB; OrderedLocusNames=CP0128} - Shigella 

{GENE: Name=ipaC} - Shigella dysenteriae 

{GENE: Name=ipaC; OrderedLocusNames=CP0127} - Shigella 
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Invasin precursor (Outer membrane adhesin). {GENE: Name=yadA; Synonyms=yopA, invA, 
yopl} - Yersinia pseudotuberculosis 



Search in TrEMBL: There are matches to 45 out of 1400820 entries 

007390 

Invasin 1 {GENE:Name=invl} - Mycobacterium avium 
007391 

Invasin 2 {GENE:Name=inv2} - Mycobacterium avium 
084129 

Predicted polysaccharide hydrolase-invasin repeat family {GENE:Name=ydhO; 
OrderedLocusNames=CT127} - Chlamydia trachomatis 
084606 

Invasin repeat family phosphatase {GENE:Name=papQ; OrderedLocusNames=CT601} - 
Chlamydia trachomatis 
084764 

Muramidase (Invasin repeat family) {GENE:Name=nlpD; OrderedLocusNames=CT759} - 

Chlamydia trachomatis 
047380 * ^ — 

Invasin - Escherichia coli 
056889 W^-^ 

Invasin {GENE:Name=invA} - Yersinia enterocolitica 
056937 

Invasin {GENE:Name=inv} - Yersinia pestis 
Q6T8F2 

Putative invasin-like protein (Fragment) - Chlamydia suis 
06T8F6 

TetR(C)-invasin fusion protein (Fragment) - Chlamydia suis 
06T8F7 

Invasin-like protein (Fragment) - Chlamydia suis 
06T8G9 

Invasin-like protein (Fragment) - Chlamydia suis 
Q74QZ6 

Putative invasin {GENE:OrderedLocusNames=YP3306} - Yersinia pestis 
07A8L6 

Putative invasin {GENE:OrderedLocusNames-ECs5290} - Escherichia coli 0157:H7 
O7AGX0 

Adhesin/invasin-like protein {GENE:OrderedLocusNames=ECs0548} - Escherichia coli 
0157:H7 
O 7AHB0 

Putative invasin {GENE:OrderedLocusNames=ECs0336} - Escherichia coli 0157:H7 
Q7N592 

Similar to putative invasin and adhesin {GENE:OrderedLocusNames=plu2064} - Photorhabdus 
luminescens (subsp. laumondii) 
Q83T27 

Invasin-like protein {GENE:OrderedLocusNames=tl573} - Salmonella typhi 
Q8EFM1 

Invasin domain protein {GENE:OrderedLocusNames=S01949} - Shewanella oneidensis 
Q8FY77 



http://us.expasy.org/cgi-bin/sprot-search-de7invasin 



9/14/04 



Search in Swiss-Prot and TrEMBL for: invasin 



Page 3 of 4 



Intimin/invasin family protein {GENE:OrderedLocusNames=BR2009} - Brucella suis 
Q8KH85 

Putative adhesin/invasin - Neisseria meningitidis 
Q8KHF7 

Putative adhesin/invasin - Neisseria meningitidis 
Q8KHP5 

Putative adhesin/invasin - Neisseria meningitidis 
Q8KI42 

Putative adhesin/invasin - Neisseria meningitidis 
08KUJ1 

Putative adhesin/invasin - Neisseria meningitidis 
Q8KUJ6 

Putative adhesin/invasin - Neisseria meningitidis 
Q8KWG5 

Putative invasin protein {GENE:Name=agg3B} - Escherichia coli [Plasmid pAA-like] 
Q8RBD9 

Soluble lytic murein transglycosylase and related regulatory proteins (Some contain LysM/invasin 
domains) (GENE:Name=MltE2; OrderedLocusNames=TTE0883} - Thermoanaerobacter 
tengcongensis 
Q8RBE4 

Soluble lytic murein transglycosylase and related regulatory proteins (Some contain LysM/invasin 
domains) {GENE:Name=MltE; OrderedLocusNames=TTE0876} - Thermoanaerobacter 
tengcongensis 
Q8RQ63 

Putative adhesin/invasin {GENE:Name=Aa32-l-2} - Actinobacillus actinomycetemcomitans 
(Haemophilus actinomycetemcomitans) 
Q8VM53 

Putative invasin {GENE:Name=pagN} - Salmonella enterica Illb 50:k:z 
Q8XB95 

Putative invasin {GENE:OrderedLocusNames=z5932} - Escherichia coli 0157:H7 
Q8Z795 

Invasin-like protein {GENE:OrderedLocusNames=STY1395} - Salmonella typhi 
Q8Z7G3 

Putative invasin {GENE:OrderedLocusNames=STY1284, tl677} - Salmonella typhi 
Q8ZA73 

Putative invasin (Hypothetical) {GENE :OrderedLocusNames=YP03 944, y3884} - Yersinia pestis 
Q 8ZP33 

Putative invasin {GENE:Name=ychP; OrderedLocusNames=STM1768} - Salmonella 
typhimurium 
08ZP78 

Homology to invasin C of Yersinia; intimin {GENE:OrderedLocusNames=STM1669} - 
Salmonella typhimurium 
O9AM40 

Invasin (Fragment) {GENE:Name=inv} - Yersinia pseudotuberculosis 
Q9JSH2 

Polysaccharide hydrolase-invasin repeat family {GENE:Name=ydhO} - Chlamydia pneumoniae 
(Chlamydophila pneumoniae) 
Q9JXK7 

Adhesin/invasin, putative {GENE:OrderedLocusNames=NMB1994} - Neisseria meningitidis 
(serogroup B) 
O9KK70 
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Putative invasin 996A009 (Fragment) - Mycobacterium avium 
Q9L574 

Invasin-like SivH {GENE:Name=sivH} - Salmonella typhi 
Q9X4L4 

Invasin homolog AafB {GENE:Name=aafB} - Escherichia coli 
O9Z704 

Muramidase (Invasin repeat family) (Hypothetical protein CP0964) (Cell wall hydrolase) 
{GENE:Name=nlpD; Synonyms^lytE; OrderedLocusNames=CP0964, CPn0902, CpB0934} - 
Chlamydia pneumoniae (Chlamydophila pneumoniae) 
Q9Z8T9 

Polysaccharide Hydrolase-Invasin Repeat Family (NLP/P60 family protein) (P60) 
{GENE:Name=ydhO; OrderedLocusNames=CP0517, CPn0245, CpB0252} - Chlamydia 
pneumoniae (Chlamydophila pneumoniae) 



New Search | 

in Swiss-Prot/TrEMBL by AC, ID, description, 
gene name, organism 

Please do NOT use any boolean operators (and, 
or, etc.) 



If you would like to retrieve all the Swiss-Prot/TrEMBL entries contained in this list , you can enter a 
file name. These entries will then be saved to a file under this name in the directory outgoing of the 
ExPASy anonymous ftp server , from where you can download it. (Please note that this temporary file 
will only be kept for 1 week.) 

File name: 1 ■ 

Format: <? Swiss-Prot O Fasta 



Reset | Qr | Create file | 
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Search ExPASv 


Contact us [ Swiss-Prot | 


Search JSwiss-Prot/TrEMBL 


Z.J for |invasin 
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Go 


Clear 





NiceProt 
View of 
TrEMBL: 
Q56889 



Printer-friendly view 



[Entry into] [Name and origin] [References] [Comments] [Cross-reference [Keywords] 

[Features] [Sequence] [Tools] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents . 

Entry information 



Q56889 
Q56889 

None 

Release 01, November 1996 
Release 01, November 1996 



Entry name 

Primary accession number 
Secondary accession numbers 
Entered in TrEMBL in 
Sequence was last modified in 
Annotations were last modified in Release 26, March 2004 
Name and origin of the protein 
Protein name 
Synonyms 
Gene name 
From 

Taxonomy 



Invasin 

None 

Name: invA 

Yersinia enterocolitica [TaxID: 630 ] 

Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales : 
Enterobacteriaceae ; Yersinia . 

References 

[1] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=W1024; 

MEDLINE=94 1 95 1 00;PubMed=75 1 1 772 [ NCBL ExPASy, EBI, Israel Japan] 
Pepe J.C . Badger J.L. , Miller V.L. ; 

"Growth phase and low pH affect the thermal regulation of the Yersinia enterocolitica inv gene."; 
MoL Microbiol. 11:123-135(1994). 

[2] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=W1024; 

MEDLINE=98048470;PubMed=9387224 [ NCBL ExPASv , EBL Israel Japan ] 

Fauconnier A. , Allaoui A. , Campos A. , Van Elsen A. , Cornelis G.R. . Bollen A. ; 

"Flagellar flhA, flhB and flhE genes, organized in an operon, cluster upstream from the inv locus in 

Yersinia enterocolitica."; 

Microbiology 143:3461-3471(1997) . 

Comments 

None 

Cross-references 
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EMBL 

PIR 

HSSP 

GO 



InterPro 



Z48169; CAA88188.1; -.[ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
S54216; S54216. 

PI 1922 : 1CWV. [ HSSP ENTRY / PDB] 

GO:0Q07155 ; Biological process: cell adhesion (inferred from electronic 
annotation). 

QuickGo 
view. 

IPR003344 : Big_l. 
1PR003535 : Intimin. 
IPRQ08964 : Invasin_intimin. 
Graphical view of domain structure . 
PF02369 : Big_l; 1. 

Pfam graphical view of domain structure . 
PRO 1369 : INTIMIN. 

[ Domain structure / List of seq. sharing at least 1 domain ] 
[ Family / Alignment / Tree ] 
056889 . 
056889 . 
056889 . 

Q56889 : C817F6D7766184E3. 
Get region on 2D PAGE . 

View cluster of proteins with at least 50% / 90% identity. 



Pfam 

PRINTS 

ProDom 

HOBACGEN 

ProtoMap 

PRESAGE 

ModBase 

SMR 

SWISS- 
2DPAGE 

UniRef 
Keywords 
None 
Features 
None 

Sequence information 

Length: 835 Molecular weight: 91367 CRC64: C817F6D7766184E3 [This is a checksum on the 
AA Da sequence] 
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LWGEWGSLAT YDSAEWPSGN YWTKKTSTDF VTMDMTTGAI PTSAATAYPL CAEPQ Q56889 in FASTA 

format 

View entry in ori gi nal TrEMBL format 
View entry in raw text format (no links) 
Request for annotation of this TrEMBL entry 



BLAST submission on 
BLAST ExPASy/SIB 

or at NCBI (USA) 



ScanProsite , MotifScan 






Sequence analysis tools: ProtParam , Prot Scale , 
Compute pI/Mw, PeptideMass , PeptideCutten 
Dotlet (Java) 

Search the SWISS-MODEL Repository 



| ExPASy Home page 



Site Map 



Search ExPASy 



Contact us 



Swis s-Prot 



Hosted bv NCSC US 


Mirror sites: 


Australia 


Bolivia 


Canada 


China 


Korea 


Switzerland 


Taiwan 
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[l tk ExPASv Home page 


Site Map 


Search ExPASv 


Contact us 


Swiss-Prot j 




Go 


Clear 




Search |Swiss-Prot/TrEMBL | f or invasin j 



NiceProt 
View of 
Swiss- 
Prot: 
P31489 



Printer-friendly view 



L 



Quick BlastP search 



[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] 

[ Features ] [ Sequence ] [ Tools ] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents . 
Entry information 



Entry name 

Primary accession number 
Secondary accession numbers 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in 
Name and origin of the protein 
Protein name 
Synonym 
Gene name 



YADA_YEREN 
P31489 

None 

Release 26, July 1993 
Release 26, July 1993 
Release 44, July 2004 



Invasin [Precursor] 
Outer membrane adhesin 

Name: yadA 

Synonyms: yopA, invA, yopl 

From Yersinia enterocolitica [TaxID: 630 ] 

Encoded on Plasmid pYV. 

Taxonomy Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales : 

Enterobacteriaceae ; Yersinia . 

References 

[1] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=6471/76 / Serotype 0:3; 

MEDLINE=95020586;PubMed=7934875 [ NCBL ExPASv . EBI, Israel . Japan ] 

Tamm A. . Tarkkanen A. . Korhonen T.K. . KuuselaP. , Toivanen P. , Skurnik M. ; 

"Hydrophobic domains affect the collagen-binding specificity and surface polymerization as well as 

the virulence potential of the YadA protein of Yersinia enterocolitica."; 

Mol. Microbiol. 10:995-1011(1993). 

Comments 

• FUNCTION: Invasin is a protein that allows enteric bacteria to penetrate cultured mammalian 
cells. The entry of invasin in the cell is mediated by binding several beta-1 chain integrins. This 
protein is an adhesin forming a fibrillar matrix on the cell surface. 

• SUBCELLULAR LOCATION: Outer membrane. 
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EMBL 
PIR 



InterPro 



Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinformatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use by non-profit institutions as 
long as its content is in no way modified and this statement is not removed. Usage by and for commercial entities requires a 
license agreement (See http://www.isb-sib.ch/announce/ or send an email to license@.isb-sib.ch ) 

Cross-references 

X13882; CAA32086.1; -.[ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
S04912; S04912. 
IPR008126 : Adhesion. 
IPR008640 ; Hep_Hag. 
IPR008635 : HIM. 
IPR005594 ; YadA_C. 
Graphical view of domain structure . 

PF05658; Hep Hag; 4. 
PF05662 ;HIM; 1. 
PF03895 : YadA: 1. 

Pfam graphical view of domain structure . 
PR01756 : OMADHESIN. 

[ Domain structure / List of seq. sharing at least 1 domain ] 
[ Family / Alignment / Tree ] 
P31489 . 
P31489 . 
P31489 . 
P31489 . 
P31489 . 
P31489. 



Pfam 



PRINTS 
ProDom 
HOBACGEN 
BLOCKS 
ProtoNet 
ProtoMap 
PRESAGE 
DIP 

ModBase 
SMR 

SWISS-2DPAGE 
UniRef 
Keywords 

Outer membrane : Plasmid : Signal: Virulence . 

Features 



P31489 : AC12EF68C657DAC0. 
Get region on 2D PAGE . 

View cluster of proteins with at least 50% / 90% identity. 



Feature table viewer 



Key From To Length Description 

SIGNAL 1 25 25 

CHAIN 26 455 430 Invasin. 



Sequence information 

Length: 455 AA [This is the 
length of the unprocessed 
precursor] 



Molecular weight: 47136 Da 
[This is the MW of the 
unprocessed precursor] 



CRC64: AC12EF68C657DAC0 [This 
is a checksum on the sequence] 
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P31489 in FASTA 
format 



View entry in original Swiss-Prot format 

View entry in raw text format (no links) 

Report form for errors/updates in this Swiss-Prot entry 



BLAST submission on 
BLAST ExPASy/SIB 

orat NCBI (USA) 



PS 



ScanProsite , MotifScan 




Sequence analysis tools: ProtParam , ProtScale » 
Compute pI/Mw - PeptideMass , PeptideCutter , 
Dotlet (Java) 

Search the SWISS-MODEL Repository 
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Site Map 


Search ExPASv 


Contact us 
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Go 


i Clear 




Search | Swiss-Prot/TrEMBL for (invasin 



NiceProt 
View of 
Swiss- 
Prot: 
P10858 



Printer-friendly view 



Submit update 



Quick BlastP search 



[Entry hifb] [Name and [References] [Comments] [Cross-references [Keywords] 

[Features] [ Sequence ] [ Tools ] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents . 
Entry information 



YADAYERPS 
P10858 

None 

Release 11, July 1989 
Release 11, July 1989 



Entry name 

Primary accession number 
Secondary accession numbers 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in Release 44, July 2004 
Name and origin of the protein 
Protein name 
Synonym 
Gene name 



Invasin [Precursor] 
Outer membrane adhesin 
Name: yadA 

Synonyms: yopA, invA, yopl 
From Yersinia pseudotuberculosis [TaxID: 633 ] 

Encoded on Plasmid pIB 1 . 

Taxonomy Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales ; 

Enterob acteriaceae ; Yersinia . 

References 

[ 1 ] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=YPIII; 

DOI= 10.1038/334S22a0 :MEDLINE=88302441:PubMed=3043229 [ NCBL ExPASv , EBI, Israel 
Japan ] 

Rosqvist R. , SkurnikM. , Wolf-WatzH. ; 

"Increased virulence of Yersinia pseudotuberculosis by two independent mutations."; 
Nature 334:522-525(1988). 
[2] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=YPIII; 

MEDLINE=89343638;PubMed=2761389 [NCBL ExPASv . EBL Israel Japan ] 
Skurnik M. . Wolf-WatzH. ; 

"Analysis of the yopA gene encoding the Yopl virulence determinants of Yersinia spp."; 
Mol. Microbiol. 3:517-529(1989). 
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Comments 

• FUNCTION: Invasin is a protein that allows enteric bacteria to penetrate cultured mammalian 
cells. The entry of invasin in the cell is mediated by binding several beta-1 chain integrins. This 
protein is an adhesin forming a fibrillar matrix on the cell surface. 

• SUBCELLULAR LOCATION. Outer membrane. 
Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinformatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use by non-profit institutions as 
long as its content is in no way modified and this statement is not removed. Usage by and for commercial entities requires a 
license agreement (See http://www.isb-sib.ch/announce/ or send an email to license(5>,isb-sib.ch ) 

Cross-references 

X13883; CAA32088.1; -.[ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
S04534; S04534. 
IPR008126 : Adhesion. 
IPR008640 ; Hep_Hag. 
IPR008635 ; HIM. 
IPR0Q5594 : YadA_C. 
Graphical view of domain structure . 
PF05658 ; Hepjiag; 4. 
PF05662 : HIM; 1. 
PF03895 : YadA; 1. 

Pfam graphical view of domain structure . 
PRO 1756 ; OMADHESIN. 

[ Domain structure / List of seq. sharing at least 1 domain ] 
[ Family / Alignment / Tree ] 
P10858. 



EMBL 
PIR 



InterPro 



Pfam 



PRINTS 
ProDom 
HOBACGEN 
BLOCKS 
ProtoNet 
ProtoMap 
PRESAGE 
DIP 

ModBase 
SMR 

SWISS-2DPAGE 
UniRef 
Keywords 

Outer membrane ; Plasmid ; Signal; Virulence . 

Features 



P10858 . 
P10858 . 
P10858 . 
P10858 . 
P10858 . 

P10858 ; EE2C55FB12B183D4. 
Get region on 2D PAGE . 

View cluster of proteins with at least 50% / 90% identity. 



ft 



Feature table viewer 



Key From To Length Description 

SIGNAL 1 25 25 

CHAIN 



26 



434 



409 



Invasin . 



Sequence information 



Length: 434 AA [This is the Molecular weight: 45054 Da 
length of the unprocessed [This is the MW of the 



CRC64: EE2C55FB12B183D4 [This 
is a checksum on the sequence] 
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P10858 in FASTA 
format 



View entry in original Swiss-Prot format 

View entry in raw text format (no links ) 

Report form for errors/updates in this Swiss-Prot entry 



BLAST submission on 
BLAST ExPASy/SIB 

or at NCBI (USA) 




ScanProsite , MotifScan 




Sequence analysis tools: ProtParam , ProtScale , 
Compute pI/Mw , PeptideMass , PeptideCutter , 
Dotlet (Java) 

Search the SWISS-MODEL Repository 
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ExPASy BLAST2 Interface 



Page 1 of 62 



ExPASy Home page Site Map Search ExPASy 



Contact us 



Proteomics tools 



Swiss-Prot | 



Search |Swiss-Prot/TrEMBL "CT forjVirG 



Welcome to the SIB BLAST Network Service 

If results of this search are reported or published, please mention that 
the computation was performed at the SIB using the BLAST network service. 
The SIB BLAST network service uses a server developed at SIB and the NCBI 
BLAST 2 software. 

In case of problems, please read the online BLAST help . 

If your question is not covered, please contact < helpdesk@expasy.org > . 



NCBI BLAST program reference [ PM ID : 9254694] : 

Altschul S.F., Madden T.L., Schaffer A. A. , Zhang J., Zhang Z., Miller W. , 
Lipman D.J. Gapped BLAST and PSI -BLAST: a new generation of protein 
database search programs. Nucleic Acids Res. 25:3389-3402(1997). 



Query length: 682 AA (of which 2% low- complexity regions filtered out) 
Date run: 2004-09-14 18:42:59 UTC+0100 on sib-gml . unil . ch 
Program: NCBI BLASTP 1 . 5 . 4 -Paracel [2003-06-05] 
Database : EXPASY/UniProt 

1,568,464 sequences; 501,960,298 total letters 



Taxonomic view ! NiceBlast view Printable view 



List of potentially matching sequences 

Send selected sequences to |Clustal W (multiple alignment) [ ^1 j Submit Query j 



□ Include query sequence 

Db AC Description Score E-value 

D tr Q51112 Similar to Shigella flexneri VirG protein (Fragment) [... 1367 0.0 

□ tr Q9JTL4 Putative virulence associated protein [NMA1725] [Neiss... 1043 0.0 

□ tr Q6Q509 Lav (Fragment) [Haemophilus influenzae] 962 0.0 
[j tr Q93N95 Lav (Fragment) [lav] [Haemophilus influenzae] 954 0.0 
C tr Q93N94 Las (Fragment) [las] [Haemophilus influenzae biotype a... 663 0.0 
G tr Q9JPL5 Autotransporter A [autA] [Neisseria meningitidis] 389 e-107 
n tr Q9ZF58 Virulence -associated protein VapA [vapA] [Neisseria me... 388 e-106 
D tr Q9JSR2 Putative virulence associated protein [vapA] [Neisseri... 387 e-106 

□ tr Q6J5G0 Putative virulence-associated protein (Fragment) [Haem. . . 219 2e-55 

□ tr Q8X6C1 Putative beta-barrel outer membrane protein [z0402] [E. . . 142 2e-32 
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AidA-I adhesin-like protein [ECs0362] [Escherichia col... 
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Q6KCW6 YapH homolog [yapH] [Escherichia coli] 
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137 


9e 


-31 


n 


tr 


Q7BCK4 


VirG [virG] [Shigella flexneri] 


134 


4e 


-30 


□ 


tr 


Q99Q93 


IcsA (VirG), outermembrane protein exposed to the bact . . . 


134 


4e 


-30 




tr 


Q52298 


Virulence-associated VirG [Plasmid pMYSH6000] 


134 


4e 


-30 


□ 


tr 


Q9JMS3 


YchA protein [ychA] [Escherichia coli] 


134 


6e 


-30 


n 


tr 


Q6WE2 7 


IcsA [icsA] [Shigella flexneri] 


134 


7e 


-30 


□ 


tr 


Q8ZHA1 


Putative autotransporter protein [yapH] [Yersinia pestis] 


130 


le 


-28 


□ 


tr 


Q8CZU2 


Putative autotransporter adhesin [yapH] [Yersinia pestis] 


130 


le 


-28 


n 


tr 


Q9F289 YapD protein [yapD] [Yersinia pestis] 


130 


le 


-28 


□ 


tr 


Q9F2 85 YapH protein [yapH] [Yersinia pestis] 


130 


le 


-28 


1 — i 


tr 


Q74QP7 


Putative autotransporter protein [yapH] [Yersinia pestis] 


13 0 


le 


-28 


□ 


tr 


Q9F286 


YapG protein (Putative ATP-binding transport component... 


129 


2e 


-28 


□ 


tr 


Q74RV9 


Putative autotransporter protein [YP2907] [Yersinia pe... 


129 


2e 


-28 


□ 


sp 


P77286 


YDEU_ECOLI Hypothetical protein ydeU [ydeU] [Escherich . . . 


129 


2e 


-28 


□ 


tr 


Q8XAY3 


Putative ATP-binding component of a transport system a... 


129 


2e 


-28 


□ 


tr 


Q7AE01 


Putative ATP-binding component of a transport system a... 


129 


2e 


-28 


□ 


tr 


Q83LF4 Hypothetical protein SF1157 [SF1157] [Shigella flexneri] 


128 


4e 


-28 


n 


tr 


Q7UCU4 


Hypothetical protein [S1242] [Shigella flexneri] 


128 


4e 


-28 


□ 


tr 


Q9CKA8 


Hypothetical protein PM1717 [PM1717] [Pasteurella mult... 


126 


le 


-27 


□ 


tr 


P75997 


Putative part of putative ATP-binding component of a t... 


118 


3e 


-25 


n 


sp 


Q03155 


AIDA_ECOLI Adhesin aidA-I precursor [aidA-I] [Escheric . . . 


118 


4e 


-25 


n 


tr 


Q7CPG9 


Putative autotransported protein [misL] [Salmonella ty. . . 


110 


7e 


-23 


n 


tr 


Q9Z625 


MisL [misL] [Salmonella typhimurium] 


110 


7e 


-23 


□ 


sp 


P45508 YFAL ECOLI Hypothetical protein yfaL precursor [yfaL] . . . 


96 


2e 


-18 


□ 


tr 


Q8ZN57 


Similar to the C-terminal region of AIDA [shdA] [Salmo. . . 


96 


3e 


-18 


n 


tr 


Q8CW7 


Hypothetical protein yfaL [yfaL] [Escherichia coli 06] 


96 


3e 


-18 


n 


tr 


Q9XCJ4 


ShdA [shdA] [Salmonella typhimurium] 


96 


3e 


-18 


n 


tr 


Q8XE2 8 


Putative ATP-binding component of a transport system [... 


93 


le 


-17 


□ 


tr 


Q7AC44 


Putative ATP-binding component of a transport system [... 


93 


le 


-17 


r 


tr 


Q9FCW0 Hypothetical protein [Kluyvera ascorbata] 


74 


9e 


-12 


□ 


tr 


Q6IU2 5 


YdeU (Fragment) [Escherichia coli B] 


70 


le 


-10 


□ 


tr 


Q88LP9 


Outer membrane autotransporter [PP1880] [Pseudomonas p. . . 


66 


2e 


-09 


□ 


tr 


Q884S7 


Autotransporter, putative [PSPTO2 011] [Pseudomonas syr. . . 


66 


2e 


-09 


□ 


tr 


Q7VYJ2 


Autotransporter [BP1344] [Bordetella pertussis] 


65 


3e 


-09 


n 


tr 


Q7WIM1 


Putative autotransporter [BB2830] [Bordetella bronchis . . . 


64 


7e 


-09 


n 


tr 


Q883X2 


Autotransporter, putative [PSPT02225] [Pseudomonas syr. . . 


64 


9e 


-09 


□ 


tr 


Q7WIB0 


Putative autotransporter [BB2941] [Bordetella bronchis... 


55 


3e 


-06 


□ 


tr 


Q7W6E1 


Putative autotransporter [BPP2 975] [Bordetella paraper. . . 


55 


3e 


-06 


□ 


tr 


Q7WQ8 0 


Autotransporter [BB0452] [Bordetella bronchiseptica (A... 


54 


le 


-05 


□ 


tr 


Q7WHU5 


Autotransporter [BB3111] [Bordetella bronchiseptica (A. . . 


54 


le 


-05 


□ 


tr 


Q7WC76 


Autotransporter [BPP0452] [Bordetella parapertussis] 


54 


le 


-05 


□ 


tr 


Q8W95 


Vag8 protein (Autotransporter) [vag8-2] [Bordetella pe... 


52 


4e 


-05 


n 


tr 


066044 


Vag8 [vag-8] [Bordetella pertussis] 


52 


4e 


-05 
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□ 


tr 


Q7WQ83 


Autotransporter [BB0450] [Bordetella bronchiseptica (A. . . 


52 


5e 


-05 


n 


tr 


Q7WC7 9 


Autotransporter [BPP0449] [Bordetella parapertussis] 


52 


5e 


-05 


n 


tr 


Q7W9Y1 


Autotransporter [BPP1618] [Bordetella parapertussis] 


52 


5e 


-05 


□ 


tr 


Q9F2 90 


YapC protein (Putative autotransporter) (Putaive autot . . . 


51 


6e 


-05 


r 


tr 


Q88ID0 


Outer membrane autotransporter [PP3069] [Pseudomonas p. . . 


51 


8e 


-05 


□ 


sp 


P76017 


YCGV_EC0LI Hypothetical protein ycgV [ycgV] [Escherich. . . 


50 


le 


-04 


r 


tr 


Q881W9 


Autotransporter, putative [PSPT02763] [Pseudomonas syr . . . 


50 


2e 


-04 


r 


tr 


Q7WL85 


Autotransporter [vag8] [Bordetella bronchiseptica (Ale... 


50 


2e 


-04 


□ 


tr 


Q7W7U5 


Autotransporter [vag8] [Bordetella parapertussis] 


50 


2e 


-04 


□ 


tr 


Q7WHU6 


Autotransporter [BB3110] [Bordetella bronchiseptica (A. . . 


49 


2e 


-04 


□ 


tr 


086135 


Tracheal colonization factor precursor [tcfA2] [Bordet . . . 


49 


2e 


-04 


□ 


tr 


Q8W97 


Tcf protein [tcfA3] [Bordetella pertussis] 


4 9 


2e 


-04 


□ 


tr 


Q9F4B3 


BapB protein [bapB] [Bordetella pertussis] 


49 


2e 


-04 


□ 


tr 


Q8GB90 


Tracheal colonization factor [tcf A] [Bordetella pertus . . . 


49 


2e 


-04 


□ 


tr 


Q45343 


Tracheal colonization factor [tcf A] [Bordetella pertus... 


49 


2e 


-04 


□ 


tr 


Q6U94 8 


Tracheal colonization factor protein [tcf A] [Bordetell . . . 


49 


2e 


-04 


□ 


tr 


Q7WKS2 


Putative autotransporter [bapC] [Bordetella bronchisep. . . 


48 


5e 


-04 


□ 


tr 


Q83RP3 


Putative adhesion and penetration protein [SF1205] [Sh. . . 


47 


0. 


001 




tr 


Q7WK68 


Autotransporter [BB2270] [Bordetella bronchiseptica (A. . . 


4.7 


0. 


001 


□ 


tr 


Q7WD6 


Autotransporter (Pseudogene) [bapC] [Bordetella pertus... 


47 


0. 


001 


□ 


tr 


Q7C1Y6 


Putative adhesion and penetration protein [S1289] [Shi... 


47 


0. 


001 


□ 


tr 


086044 


Putative autotransporter (BapC protein) [bapC] [Bordet... 


47 


0. 


001 


□ 


tr 


Q7W8V8 


Autotransporter [BPP2022] [Bordetella parapertussis] 


47 


0. 


001 


□ 


tr 


Q7W1F5 


Autotransporter [BPP0735] [Bordetella parapertussis] 


46 


0. 


002 


□ 


tr 


Q7WP64 


Autotransporter [BB0821] [Bordetella bronchiseptica (A. . . 


45 


0. 


003 


□ 


tr 


Q6G430 Hypothetical protein [BH05490] [Bartonella henselae (R. . . 


44 


0. 


008 


□ 


tr 


Q7W7D4 


Putative autotransporter [bapC] [Bordetella parapertus . . . 


44 


0 . 


008 


r 


tr 


Q9JMS5 


YcbB protein [ycbB] [Escherichia coli] 


44 


0. 


013 


n 


tr 


Q9XD84 


TibA [tibA] [Escherichia coli] 


43 


0 . 


017 


□ 


tr 


Q6G428 


Hypothetical protein [BH05510] [Bartonella henselae (R. . . 


42 


0. 


029 


n 


tr 


Q83JR2 


Exported serine protease SigA [sigA] [Shigella flexneri] 


42 


0 . 


029 


□ 


tr 


Q9L8L1 


Exported serine protease SigA [sigA] [Shigella flexner. . . 


42 


0. 


029 


□ 


tr 


Q7C013 


Serine protease [sigA] [Shigella flexneri] 


42 


0. 


029 


□ 


tr 


Q8UJW1 


Autotransporter protein [yapE] [Agrobacterium tumef aci . . . 


40 


0 


.14 


□ 


tr 


Q7D396 


AGR_pAT_52 8p [AGR_j>AT_52 8] [Agrobacterium tumef aciens . . . 


40 


0 


. 14 


□ 


tr 


Q7WK90 


Autotransporter [phg] [Bordetella bronchiseptica (Alca. . . 


40 


0 


.19 


n 


tr 


Q7W8X9 


Autotransporter [phg] [Bordetella parapertussis] 


40 


0 


.19 


□ 


tr 


Q84GK0 


Secreted autotransporter protein EatA [eatA] [Escheric... 


40 


0 


.19 


□ 


tr 


Q8UJX1 


Autotransporter protein [bapA] [Agrobacterium tumef aci .. . 


39 


0 


.25 


□ 


tr 


Q7D3A2 


AGRjpAT_511p [AGR_pAT_511] [Agrobacterium tumefaciens . . . 


39 


0 


.25 


□ 


tr 


Q6MTD5 


Hypothetical transmembrane protein [MSC_0473] [Mycopla. . . 


39 


0 


.32 


n 


sp 


P45386 


IGA4_HAEIN Immunoglobulin Al protease precursor (EC 3.... 


39 


0 


.42 


n 


tr 


Q9Z5R3 


Phg protein (Autotransporter) [phg] [Bordetella pertus... 


38 


0 


.55 




tr 


Q6KD18 


Antigen 43 precusor Sap [sap] [Escherichia coli] 


31 


0 


.71 


□ 


tr 


Q8FDW4 


Aecreted auto transpoter toxin [sat] [Escherichia coli... 


3 7 




1.2 
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Graphical overview of the alignments 

| ... , I to resubmit your query after masking regions matching PRO SITE profiles 

uick here j Qr p£am HMMs 



(® Help ) (use ScanProsite for more details about PROSITE matches) 



Profile hits 
Pfan hits 
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Hatches on query sequence Hat 
Submission JS#^^^^^^^^^^^^^^»"^^^™«m"^"""^™i^^™^^^™«w^^m^w^bm 

1 500 1 

Q6Q599 L_m _ „ _r _ ^ n_: „ * . ;:„:_:_ r ; 53 

Q93N95 ii;:;;:;: 



Q93N94 * j - . . .. . „:.i„„t • . » - s _ .;. cro 
Q9JPL5 

Q9ZF58 . . . j 

Q9JSR2 „ _ _ „ . * „_j 

Q6J5G9 « . 1 . 

Q8X6C1 e::_:„: fmwrzz. s — . san mm 

Q7RH89 *r?.:::-: /_ ......... — . — . 

Q6KCH6 nzuL 

Q8FFF9 r .„„ r ~ 7 zzzz. — 

Q7BCK4 cll : „z _ ; 

Q52298 t:i;:.: ::;.: :: ;:;::. r i:.::::; : : v. ::. . ::. \ 

Q6ME27 7 ' .." . .. . .. 

Q74QP7 r i . . v r : i r — r i 

Q9F286 ; j 

ydeuHecoli mm |5t 7 777.77 S S i 

Q8XRY3 mx: ... i / i:.....:' :;:::i:.ij 

Q7AE81 ^ l„ : ;. ;:; ; ia i: : i 

P75997^^^^ „_ ; 

Q7CPG9 c::; ::.:;;:: ; : :;:::::: i::: zzz:zz ;r ; :: ::::::::; 

YFflL_EC0LI l „ : :. ; a mm 

Q8ZN57 li.. „ : - : : ' zzzzj b 

Q8CVV7 i::;::; ;.:;::;..:.; :; :zzzzz:..::zzzzzs., ;:::::; :j iczd 

Q9XCJ4 Li: 'zz : ' mzm 

Q8XE28 i^^— :=: : : : ; zzzn nzzi 

Q7fiC44 tzzizzzzzzzz . :n pm 

Q9FCH8 mmii - s i mm 

Q6IU25 □ 

Q88LP9 rrr^— — — = ~ „ j _ |g 

Q7VYJ2 ™r: r\ rrrj ^ 

Q7MQ8G Bm2 ™ B " ' ""^'^p— ^7777::. zzz: zzzzi *™ t; :..;i 

Q7HHU5 i 



i. .. J 



Q7HC76 

066844 i . * „ j , . ; . zzzzz . 3 . „m l::::i 

Q7H9Y1 ^ S 1 ^^ I ;r ;rj 

Q88ID8 * < " * _ 7 ~:i i i 

Q881H9 M ' Izzizzzzzzzzzm IEj . 

Q7ML85 e-i aas: us; : r: :: . sac :rnn l :::::;) 

Q7M7U5 . i::.::: :: .:: :;.:., :;:: :::: zzzzzzz. zzz: Z3 cm 

Q7MHUB cr :: — sa t~ i 

086135 nCT^^;^-'. -^xrai i 

Q8VV97 rarpwaF-r^^ m 

Q9F4B3 e: — aw i^o 

Q8GB98 
Q45343 
Q6U948 
Q7HKS2 
Q83RP3 
Q7HK68 
Q7VVD6 
Q7C1Y6 
086844 
Q7M8V8 



L- 11 



Q7MP64 - : : -- - T — , L -:n 

Q6G438 nzzi 

Q7H7D4 rr~: ~vr:.:i:-*"i : ^i:~rxzzzzii.x~z::- :::.r "r " - ^,-z^zzn czzzj 

Q9JHS5 t^mm-r - zzzzi ^ ^ ^ rzzzn 
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Alignments 



tr Q51112 Similar to Shigella flexneri VirG protein (Fragment) 682 
[Neisseria AA 
meningitidis] align 

Score = 1367 bits (3538), Expect = 0.0 

Identities = 669/682 (98%) , Positives = 669/682 (98%) 

Query : 1 QRKIYYETRXXXXXXXXXXXXXIVSLFFSILYTSPLLAVDYVYDKTKLTNDEITRLKKLR 60 

QRKIYYETR IVSLFFSILYTSPLLAVDYVYDKTKLTNDEITRLKKLR 
Sbjct : 1 QRKIYYETRSKQASKQKFKKSFIVSLFFSILYTSPLLAVDYVYDKTKLTNDEITRLKKLR 60 

Query: 61 DKTSEYWKKETYLITEDNPKVPPFPALYPRTYQFENINNSKKISFYDQEYTEGYLVGFAR 12 0 

DKTSEYWKKETYLITEDNPKVPPFPALYPRTYQFENIMNSKKISFYDQEYTEGYLVGFAR 
Sbjct : 61 DKTSEYWKKETYLITEDNPKVPPFPALYPRTYQFENINNSKKISFYDQEYTEGYLVGFAR 12 0 

Query: 121 GLGVAKRNGDTEEQIRKYFKECFNSNTKIRDYSTCQAEKFGSHPLIVKSHIFSLGPKIKN 18 0 

GLGVAKRNGDTEEQIRKYFKECFNSNTKIRDYSTCQAEKFGSHPLIVKSHIFSLGPKIKN 
Sbjct: 121 GLGVAKRNGDTEEQIRKYFKECFNSNTKIRDYSTCQAEKFGSHPLIVKSHIFSLGPKIKN 180 

Query: 181 SHINSE I LS VGNYTEWANQVIHHI ENYVS FAAHLYSGLDPFHYIEVTDNSHVIGQTI SLD 24 0 

SHINSEILSVGNYTEWANQVIHHIEMYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTISLD 
Sbjct: 181 SHINSE I LSVGNYTEWANQVIHHI ENYVS FAAHLYSGLDPFHYIEVTDNSHVIGQTI SLD 240 

Query: 241 EFRLENSLWEPRWDSNVGKLKTTNADIRFNTKSESLLVKEDYAGGARFRFAYDPKEAKNT 300 

E FRLENSLWE PRWDSNVGKLKTTNAD I RFNTKSES LLVKED YAGGARFRFAYD PKE AKNT 
Sbjct: 241 EFRLENSLWEPRWDSNVGKLKTTNADIRFNTKSESLLVKEDYAGGARFRFAYDPKEAKNT 300 

Query: 301 ALIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQ 360 

ALIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQ 
Sbjct: 301 ALIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQ 360 

Query: 3 61 RPEGFLPKCKNAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSN 42 0 

RPEGFLPKCKNAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSN 
Sbjct: 361 RPEGFLPKCKNAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSN 420 

Query : 421 QWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLS IGLMGGQAEQRSTFHNPDTDNLTTGN 4 8 0 

QWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGN 
Sbjct: 421 QWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLS IGLMGGQAEQRSTFHNPDTDNLTTGN 480 

Query: 4 81 VKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERFTSKGITASIEAGYN 54 0 

VKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERFTSKGITASIEAGYN 
Sbjct: 481 VKGFGAGVYATWHQLQDKQTGAYADS WMQ YQRFRHR INTEDGTERFTS KG I TAS I E AG YN 540 

Query: 541 ALLAEHFTKKGNS LRVYLQPQAQLTYLGVNGKF SD S ENAHVNLLGS RQLQTRVGVQAKAQ 600 

ALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQ 
Sbjct: 541 ALLAEHFTKKGNS LRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQ 600 

Query: 601 FSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQAT 660 

FSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQAT 
Sbjct: 601 FSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQAT 660 

Query: 661 FNRQTGKHHQAKQGALNLQWTF 682 

FNRQTGKHHQAKQGALNLQWTF 
Sbjct: 661 FNRQTGKHHQAKQGALNLQWTF 682 
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tr Q9JTL4 Putative virulence associated protein [NMA1725] [Neisseria 642 AA 
meningitidis (serogroup A) ] 

align 

Score = 1043 bits (2698), Expect =0.0 

Identities = 529/650 (81%), Positives = 568/650 (87%), Gaps = 12/650 (1%) 

Query: 3 7 LAVDYVYDKTKLTNDEITRLKKLRDKTSEYWKKETYLITEDNPKVPPFPALYPRTY- -QF 94 

+AVDYVYDKTKLT+DE ITRLKKLRD+ SEYWK-f-ETY I +N P PAL+P+ F 
Sbjct: 1 MAVT)YVTDKTKliTDDEITRLKKLRDRNSEYWKEETYHIKSNNRVYPNIPALFPKHPFDPF 60 

Query : 95 ENINNSKKISFYDQEYTEGYLVGFARGLGVAKRNGDTEEQIRKYFKECFNSNTKIRDYST 154 

ENINNSK+ISFYD+EYTE YLVGFA+GLGVAKRNG+TE+ IR+YFKEC N+ D T 

Sbjct : 61 ENINNSKRISFYDKEYTEDYLVGFAQGLGVAKRNGETEKPIRQYFKECLNTGKYSDD--T 118 

Query: 155 CQAEKFGSHPLIVKSHIFSLGPKIKNSHINSEILSVGNYTEWANQVIHHIENYVSFAAHL 214 

C++++ S P V+S IF+L KIKNSHINSEILSVGNY EW ++ + S HL 
Sbjct: 119 CKSQQ- -SIPT-VRSDIFALNTKIKNSHINSEILSVGNYIEWLRPTLNQLS SSQEHL 172 

Query: 215 YSGLDPFHYIEVTDNSHVIGQTISLDEFRLENSLWEPRWDSNVGKLKTTNADIRFNTKSE 2 74 

YS +DPFHYIEVTDNSHVIGQTISLDEFRLENSLWEPRWDS+VG+LKTTNADIRFNTKSE 
Sbjct: 173 YSDVDPFHYIEVTDNSHVIGQTISLDEFRLENSLWEPRWDSDVGELKTTNADIRFNTKSE 232 

Query: 275 SLLVKEDYAGGARFRFAYDPKEA- -KNTALIFEKNVTGTSDIIFENPIDDLKSLDGHQII 332 

SLLVKEDYAGGARFRFAY K+ + L FEKN+TGTSDIIFENPIDDLKSLDGHQII 
Sbjct: 233 SLLVKEDYAGGARFRFAYGLKDKVPETPVLTFEKNITGTSDIIFENPIDDLKSLDGHQII 292 

Query: 3 33 KTOGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRLKPPIPYCALR 3 92 

KVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGF K + + + ALR 

Sbjct: 293 KVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFFTKVQERDDISIYAQQAQAANTLFALR 352 

Query: 3 93 LNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQ 452 

LN+KNSD I FDRTL PRKGLWLRVI DGHSNQWVQGKTAPVE RKGVQLGGEVFTWQNE SNQ 
Sbjct: 353 LNDKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVESNRKGVQLGGEVFTWQNESNQ 412 

Query: 453 LSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQR 512 

LS+GLM GQAEQRSTF NPD TDNLTTGNVKGFG AG + Y ATWHQL QD KQTG A YAD S W -f Q YQR 
Sbjct: 413 LSVGLMSGQAEQRSTFRNPDTDNLTTGNVKGFGAGIYATWHQLQDKQTGAYADSWVQYQR 472 

Query: 513 FRHRINTEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGWGK 572 

FRHRINTED TERFTSKGITASIEAGYNALLAEHFTKKGN +R YLQPQAQLTYLGVNGK 
Sbjct: 473 FRHRINTEDATERFTSKGITASIEAGYNALLAEHFTKKGNRVRFYLQPQAQLTYLGVNGK 532 

Query: 573 FSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER 632 

FSDSENAHVNLLGSRQLQ+RVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER 
Sbjct: 53 3 FSDSENAHVNLLGSRQLQSRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER 5 92 

Query: 633 RVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

R + 1 NNKTAI ES QLGVAVKI KS HLTLQATFNRQTGKHHQAKQG ALNLQWT F 
Sb j ct : 5 93 RM INNKTAI E S QLGVAVKI KSHLTLQATFNRQTGKHHQAKQGALNLQWTF , 642 



tr Q6Q509 Lav (Fragment) [Haemophilus influenzae] 695 AA 

align 
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Score = 962 bits (2486), Expect = 0.0 

Identities = 494/665 (74%), Positives = 547/665 (81%), Gaps = 13/665 (1%) 

Query: 2 3 IVSLFFSILYTSPLLAV-DYVYDKTKLTNDEITRLKKLRDKTSEYWKKETYLITEDNPKV 81 

J+SLFFSILYTSPLLAV DY YD +KL.TN+ + I RLKKLRD+ SEYWKKETYL+ 
Sbjct: 39 IISLFFSILYTSPLLAWDYTYDNSKLTNEQIERLKKLRDRNSEYWKKETYLLKSPPSNF 98 

Query: 82 PPFPALYPR - - TYQFENINNSKKI SFYDQEYTEGYLVGFARGLGVAKRNGDTEEQIRKYF 13 9 

P P L+P+ + FENI+NSK ISFYDQ+YTE YLVGFA-fG G VAKRNGD TEE +RKYF 
Sbjct: 99 PDIPVLFPKDSSVPFENIDNSKAISFYDQKYTEDYLVGFAQGFGVAKRNGDTEEPVRKYF 15 8 

Query: 140 KECFNSNTKIRDYSTCQAEKFGSHPLIVKSHIFSLGPKIKNSHINSEILSVGNYTEWANQ 199 

KEC N+ D C+A S + +KS IF+ P IKNSHINSEIL+VGNYTE 
Sbjct: 159 KECLNTGNYNND - - NCKANPLAS S - VS I KSD I FTPRPTI KNSHINSE I LAVGNYTEL 212 

Query: 2 00 VIHHIENYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTISLDEFRLEWSLWEPRWDSNVGK 259 

++ + S+A HLYS DP + V DNSHVIGQTI L +L NSLWEPRW+SN+ 
Sbjct: 213 MLAAQPSASSWAEHLYS - -DPGLSLTVQDNSHVIGQTIDLGFLQLTNSLWEPRWNSNIDY 270 

Query: 260 LKTTNADIRFMTKSESLLVKEDYAGGARFRFAYDPKEA- -KNTALIFEKNVTGTSDIIFE 317 

L T NA+IRFNTK+ESLLV+ YAGGARFRFAYD K+ + L FE+N+TGTSDIIFE 
Sbjct: 271 LVTENAEIRFNTKNESLLVEGYYAGGARFRFAYDLKDKAPETPVLTFEQNITGTSDIIFE 330 

Query: 318 NPIDDLKSLDGHQIIKWGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRF 377 

NPIDDLKSLDGHQI+KVWGTADK+AFRLSGK++KGIYTLSLQQR EGF K + 
Sbjct: 331 NPIDDLKSLDGHQIVKVNGTADKNAFRLSGKYRKGIYTLSLQQRLEGFFTKLQERDDIAI 390 

Query: 378 MHNRLKP P I P YCALRLNNKNSD I FDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGV 437 

+ + ALRLN+KNSDIFDRTLPRKGLWLRVIDGHS-f-QWVQGKTAP+EGYRKGV 
Sbjct: 391 YAQQAQAANTLFALRLNDKNSDIFDRTLPRKGLWLRVIDGHSSQWVQGKTAPLEGYRKGV 450 

Query: 43 8 QLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQD 497 

QLGGEVFTWQNESNQ S+GLMGGQAEQRSTF NPDTDNLTTGN+KG GAGVYATWHQLQD 
Sbjct: 451 QLGGEVFTWQNESNQFSVGLMGGQAEQRSTFRNPDTDNLTTGNMKGLGAGVYATWHQLQD 510 

Query: 498 KQTGAYADS WMQ YQRFRHR INTEDGTERFTS KG I TAS I EAG YNALLAEHFTKKGNS LRVY 557 

KQTGAY DSW QYQR FRHR INTEDGTERFTS KG I TAS I EAG YNALLAEH T KG +R Y 
Sbjct: 511 KQTGAYVD SWAQ YQRFRHR INTEDGTERFTS KG I TAS I EAG YNALL AEHLTGKGTQ I RF Y 570 

Query: 558 LQPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNA 617 

LQPQAQLTYLGVNGKFSDSEN+ VNLLGSRQLQ+RVGVQAKAQF L KNI I+PFAAVN 
Sbjct: 571 LQPQAQLTYLGVNGKFSDSENSQVNLLGSRQLQSRVGVQAKAQFLLNKNIVIQPFAAVNT 63 0 

Query: 618 LYHNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALN 677 

LYH+KPFGVE-fDGERRVINNKTAIESQ G+ AVKIKSHLTLQATFNRQTGKHH AKQGALN 
Sbjct: 631 LYHSKPFGVEIDGERRVINNKTAIESQFGIAVKIKSHLTLQATFNRQTGKHHHAKQGALN 690 

Query: 67 8 LQWTF 682 
LQWTF 

Sbjct: 691 LQWTF 695 



tr Q93N95 Lav (Fragment) [lav] [Haemophilus influenzae] 692 AA 

align 



Score = 954 bits (2467), Expect = 0.0 
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Identities = 487/664 (73%), Positives = 543/664 (81%), Gaps = 13/664 (1%) 

Query : 2 3 IVSLFFSILYTSPLLAVDYVYDKTKLTNDEITRLKKLRDKTSEYWKKETYLITEDNPKVP 82 

I+SL S LY+ S PLLAVD YVYDKTKLT+DE I TRLKKLRD+ SEYWK+ETY I P 
Sbjct : 38 ILSLLVSALYSSPLLAVDYVYDKTKLTDDEITRLKKLRDRDSEYWKEETYFIKSTPQSPP 97 

Query : 83 PFPALYPRTY- -QFENINNSKKISFYDQEYTEGYLVGFARGLGVAKRNGDTEEQIRKYFK 140 

P L+P+ FENINNSK+ISFYD+E+TE YLVGFA-fG GVAKRNG + TEE +R+YFK 

Sbj Ct : 98 NIPRLFPKNSFDSFENINNSKEISFYDKEFTEDYLVGFAQGFGVAKRNGETEETVRQYFK 157 

Query: 141 ECFNSNTKIRDYSTCQAEKFGSHPLIVKSHIFSLGPKIKNSHINSEILSVGNYTEWANQV 200 

EC N+ D TC++ + + +KS IF+L +KNSHINSEIL4-VGNYT+ 

Sbjct: 158 ECLNTGKHSND-PTCKS- -YSTDAYNIKSDIFALNTIVKNSHINSEILAVGNYTKLMLSA 214 

Query: 201 IHHIENYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTISLDEFRLENSLWEPRWDSNVGKL 260 

H + +A HLYS + + V DNSHVIGQTI L L NSLWEPRW+SN+ L 

Sbjct: 215 QHSS -TWAEHLYSNAELS- -LTVQDNSHVIGQTIDLGALILTNSLWEPRWNSNIDYL 268 

Query: 261 KTTNADIRFNTKSESLLVKEDYAGGARFRFAYD- -PKEAKNTALIFEKNVTGTSDIIFEN 318 

T NADIRFNTKSESLLVK. +YAGGARFRFAYD K + L FE+N+TGTSDIIFEN 
Sbjct: 269 ATENADIRFNTKSESLLVKGNYAGGARFRFAYDLQDKAPETPVLTFEQNITGTSDIIFEN 32 8 

Query: 319 PIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFM 37 8 

PID+LKSLDGHQI+KVNGTADK+AFRLSGK++KGIYTLSLQ+RPEGF K + 
Sbjct: 32 9 PIDNLKSLDGHQIVKVNGTADKNAFRLSGKYRKGIYTLSLQRRPEGFFTKVQERDNIAIY 3 88 

) 

Query: 379 HNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQ 43 8 

+ + ALRLN+KNSDIFDRTLPRKGLWLRVIDGHS-f-QWVQGKTAP+EGYRKGVQ 
Sbjct: 389 AQQAQAANTLFALRLNDKNSDIFDRTLPRKGLWLRVIDGHSSQWVQGKTAPLEGYRKGVQ 44 8 

Query: 43 9 LGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDK 4 98 

LGGEVFTWQNESNQ S+ GLMGGQAEQRSTF NPDTDNLTTGN+KGFGAGVYATWHQLQDK 
Sbjct: 449 LGGEVFTWQNESNQFSVGLMGGQAEQRSTFRNPDTDNLTTGNMKGFGAGVYATWHQLQDK 508 

Query: 499 QTGAYADSWMQYQRFRHRINTEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYL 558 

QTGAY DSW+QYQRFRH INTEDGTERFTS KGI TAS I EAG YNALLAEH T KG +R YL 
Sbjct: 509 QTGAYIDSWVQYQRFRHHINTEDGTERFTSKGITASIEAGYNALLAEHVTGKGTQIRFYL 568 

Query: 55 9 QPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNAL 618 

QPQAQLTYLGVNGKFSDSEN+ VNLLG RQLQ-f RVGVQAKAQF L KNI I+PFAAVN L 
Sbjct: 569 QPQAQLTYLGVNGKFSDSENSQVNLLGPRQLQSRVGVQAKAQFLLNKNIVIQPFAAVNTL 62 8 

Query: 619 YHNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNL 678 

YH+KPFGVE+DGERRVINNKTAIESQ G+AVKIKSHLTLQATFNRQTGK H AKQGALNL 
Sbjct: 62 9 YHSKPFGVEIDGERRVINNKTAIESQFGIAVKIKSHLTLQATFNRQTGKRHHAKQGALNL 688 

Query: 679 QWTF 682 

QWTF 

Sbjct: 689 QWTF 692 



tr Q93N94 Las (Fragment) [las] [Haemophilus influenzae biotype 703 
aegyptius] AA 



align 



Score = 663 bits (1710), Expect = 0.0 
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Identities = 337/480 (70%), Positives = 386/480 (80%), Gaps = 5/480 (1%) 

Query: 2 04 IENYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTISLDEFRLENSLWEPRWDSNVGKLKTT 2 63 

I+NY S GL +V+SVGT+L+ L+ +SLWEPR++S+V L+T 

Sbjct: 228 IDNYRSKFPDETRGLT LTVKNQSEVRGATLQLLKMVLQDSLWEPRFNSDVHHLETQ 283 

Query: 264 NADIRFNTKSESLLVKEDYAG-GARFRFAYDPKEAKNTALIFEKNVTGTSDIIFENPIDD 322 

NA+IRFN+ + L V E+Y G G+RF ++PKEA L F+K+VTGTS+I+FE PIDD 
Sbjct: 2 84 NANIRFNSTNTRLTVHENYQGDGSRFFIKFNPKEATQPVLTFDKDVTGTSNIVFEKPIDD 343 

Query: 32 3 LKSLDGHQIIKV^GTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRL 3 82 

LKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGF K + + 
Sbjct: 344 LKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFFTKVQERDDIAIYAQQA 403 

Query: 383 KPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGE 442 

•f ALRLN+KNSDIFDRTLPRKGLWLR+I GH +Q VQGKTAPVEG RKG+QLGG+ 

Sbjct: 404 QAANTLFALRLNDKNSDIFDRTLPRKGLWLRLISGHLSQDVQGKTAPVEGNRKGIQLGGD 463 

Query: 44 3 VFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGA 502 

VF+ QN+ QLS GLMGGQAEQRSTF N DTDNLTTG++KGFGAG+YATWHQLQDKQTGA 
Sb j ct : 464 VFSLQNQDYQLSFGLMGGQAEQRSTFRNSDTDNLTTGSMKGFGAGIYATWHQLQDKQTGA 52 3 

Query: 503 YADSWMQYQRFRHRINTEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQA 562 

Y DSW+QYQRFRHRINTEDG ERFTS KG I TAS I EAG YN ALLAEH T+KG +R YLQPQA 
Sbjct: 524 YVDSWVQYQRFRHRINTEDGI ERFTSKGI TAS I EAG YNALLAEHLTEKGTQ I RF YLQPQA 583 

Query: 563 QLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNK 622 

QLTYLGVNG +DS N+ WLLGSRQLQ+RVG QAKAQF+ 4- +PF A+N++Y K 

Sbjct: 584 QLTYLGVNGDLTDSGNSKVNLLGSRQLQSRVGAQAKAQFTFTNGVIFQPFVALNSIYQQK 643 

Query: 623 PFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

PFGVE+D ERRVINNKTAIESQLGVA+KIKSHLTLQATFNRQTGKHH AKQGALNLQWTF 
Sbjct: 644 PFGVEIDNERRVINNKTAIESQLGVALKIKSHLTLQATFNRQTGKHHHAKQGALNLQWTF 703 



tr Q9JPL5 Autotransporter A [autA] [Neisseria meningitidis] 619 AA 

align 

Score = 389 bits (1000) , Expect = e-107 

Identities = 207/471 (43%) , Positives = 302/471 (63%) , Gaps = 24/471 (5%) 

Query: 223 YIEVTDNSHVIGQTISLDEFRLENSLWEPRWDSNVGKLKTT-NADIRFNTKSESLLVKED 281 

++++ + SHV GQ + L + L++SLWEPR S++ L+T+ NA IR NTK E L V +D 
Sbjct: 162 HVKIENKSHVAGQVLELTKMTLKDSLWEPRRHSDIHTLETSDNARIRLNTKDEKLTVHKD 221 

Query: 282 YAGGARFRF AYDP KE AKNTAL I FEKNVTGTSD 1 1 FENP IDDLKS LDGHQ 1 1 KVNGTADKH 341 

YAGGA F F YD +E+ AL FE V+G S ++ E ++LK+LDG ++I TAD 
Sbjct: 222 YAGGADFLFGYDVRESDEPALTFEDKVSGQSGWLERRPENLKTLDGRKLIAAK-TADSG 280 

Query: 342 AFRL SGKHQKG I YTL S LQQRPEGF LPKCKNAMIWR- -FMHNRLKPPIPYCAL 391 

+ +++G+Y L L+Q GF +P+ + + + + N L L 
Sbjct: 2 81 SLAFKQNYRQGLYELLLKQCEGGFCLGVQRLAIPEAEAVLYAQQAYAANTL FGL 334 

Query: 3 92 RLNNKNSD I FDRTLPRKGLWLRVIDGHSNQWVQGKTAP VEG YRKGVQLGGEVFTWQNESN 451 

R ++ D++ R+ LWLR I G S+Q ++G A +G+RKGVQ+GGEVF QNE + 

Sbjct: 335 RAADRGDDVYAADPSRQKLWLRFIGGRSHQNIRGGAA-ADGWRKGVQIGGEVFVRQNEGS 393 
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Query: 452 QLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQ 511 

+L+IG+MGG+A Q ++ + ++ G+G GVYA WHQL+DKQTGAY D W+QYQ 

Sbjct: 3 94 RLAIGVMGGRAGQHASVNGKG- -GAAGSDLYGYGGGVYAAWHQLRDKQTGAYLDGWLQYQ 451 

Query: 512 RFRHRINTEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG 571 

RF+HRIN E + ER+ +KG TAS+E GYNAL+AE KGN++R YLQPQAQ TYLGVNG ' 
Sbjct: 452 RFKHRINDENRAERYKTKGWTASVEGGYNALVAEGIVGKGNNVRFYLQPQAQFTYLGVNG 511 

Query: 572 KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGE 631 

F+DSE V LLGS Q Q+R G++AK + F+L + + + PFAA N L+ +K FGVEMDGE 
Sbjct: 512 GFTDSEGTAVGLLGSGQWQS RAG I RAKTRFALRNGVNLQPFAAFNVLHRSKS FGVEMDGE 571 

Query: 632 RRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

+ + + +TA+E + G+ K H+ + + + ++T K+ AL+L+W F 

Sbjct: 572 KQTLAGRTALEGRFG I EAGWKGHMS AR I GYGKRTDGD KEAALSLKWLF 619 



tr Q9ZF58 Virulence-associated protein VapA [vapA] [Neisseria 619 
meningitidis] AA 

align 

Score = 388 bits (997) , Expect = e-106 

Identities = 224/576 (38%), Positives = 338/576 (57%), Gaps = 46/576 (7%) 

Query: 130 DTEEQIRKYFKECFNSNTK IRDYSTCQAEKFGSHPLIVK- SHIFSLGPKI 178 

D ++KY + + N K + +D + E + + + ++I LGPK 

Sbjct: 67 DNMPWKKYITDTYGDNLKDAVKKQLQDLYKTRPEAWEENKKRTEEAYIEQLGPKFSILK 126 

Query: 179 -KNSHINSEILSVGNYTEWANQVIHHIENYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTI 237 

KM + + + + + T +N ' + N + H + + + + SHV GQ + 

Sbjct: 127 QKNPDL I NKL VED S VLTPHSNTS QTS LNNI FNKKLH VKIENKSHVAGQVL 176 

Query: 23 8 SLDEFRLENSLWEPRWDSNVGKLKTT-NADIRFNTKSESLLVKEDYAGGARFRFAYDPKE 296 

L + L++SLWEPR S + + L+T+ NA IR NTK E L V + Y GGA F F YD +E 
Sbjct: 177 ELTKMTLKDSLWEPRRHSDIHTLETSDNARIRLNTKDEKLTVHKAYQGGADFLFGYDVRE 236 

Query: 297 AKNTALIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTL 356 

+ AL FE+NV+G S++ + E ++LK+LDG ++I AD. ++F +++G+Y L 

Sbjct: 237 SDEPALTFEQNVSGKSNWLERRPENLKTLDGRKLIAAE-KADPNSFAFKQNYRQGLYEL 2 95 

Query: 357 SLQQRPEGF LPKCKNAMIWR- -FMHNRLKPPIPYCALRLNNKNSDIFDRTLP 4 06 

L+Q GF + P+ + + + + N L LR + + D++ 

Sbjct: 2 96 LLKQCEGGFCLGVQRLAIPEAEAVLYAQQAYAANTL FGLRAADRGGDVYAADPS 349 

Query: 407 RKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRS 466 

R+ LWLR I G S+Q ++G A +G RKGVQ+GGEVF QNE ++L+IG+MGG+A Q + 
Sbjct: 350 RQKLWLRF I GGRSHQN IRGGAA - ADGRRKGVQ I GGE VF VRQNEGSRLA I GVMGGRAGQHA 408 

Query: 467 TFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERF 526 

+ + + G+G GVYA WHQL+DKQTGAY D W+QYQRF+HRIN E+ ER+ 

Sb j Ct : 4 0 9 SVNGKG- -GAAGSYLHGYGGGVYAAWHQLRDKQTGAYLDGWLQYQRFKHRINDENRVERY 466 

Query: 527 TSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGS 586 

+KG TAS+E GYNAL+AE KGN++R YLQPQAQ TYLGVNG F+DSE V LLGS 
Sbjct: 4 67 KTKGWTASVEGGYNALVAEGWGKGNNVRFYLQPQAQFTYLGVNGGFTDSEGTAVGLLGS 52 6 
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Query: 587 RQLQTRVGVQAKAQFS L YKNI AI E PF AAVNAL YHNKPFGVEMDGERRVINNKTA I E S QLG 64 6 

Q Q+R G++AK +F+L + ++PFAA N L+ +K FGVEMDGE+4- + +TA+E + G 
Sbjct: 527 GQWQSRAGIRAKTRFALRNGVNLQPFAAFNVLHRSKSFGVEMDGEKQTLAGRTALEGRFG 586 

Query: 647 VAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

+ K H+ + + ++T K+ AL+L+W F 

Sbjct: 587 IEAGWKGHMFARIGYGKRTDGD KEAALSLKWLF 619 



tr Q9JSR2 Putative virulence associated protein [vapA] 619 AA 

[Neisseria align 
meningitidis (serogroup A)] 



Score = 387 bits (993), Expect = e-106 

Identities = 223/576 (38%), Positives = 337/576 (57%), Gaps '= 46/576 (7%) 

Query: 13 0 DTEEQIRKYFKECFNSNTK IRDYSTCQAEKFGSHPLIVK-SHIFSLGPKI 17 8 

D ++KY + + N K ++D + E + + + ++I LGPK 

Sbjct: 67 DNMPWKKYITDTYGDNLKDAVKKQLQDLYKTRPEAWEENKKRTEEAYIEQLGPKFSILK 12 6 

Query: 17 9 -KNSHINSEILSVGNYTEWANQVIHHIENYVSFAAHLYSGLDPFHYIEVTDNSHVIGQTI 23 7 

KN 4- +++ + T +N + N + H +++ + SHV GQ + 

Sbjct: 127 QKNPDLINKLVEDSVLTPHSNTSQTSLNNIFNKKLH VKIENKSHVAGQVL 176 

Query: 238 S LDE FRLENS L WE PRWD SNVGKLKTT - NAD I RFNTKS E S LL VKED YAGGARFRFAYDPKE 2 96 

L + L++SLWEPR S + + L+T+ NA IR NTK E L V + Y GGA F F YD +E 
Sbjct: 177 ELTKMTLKDSLWEPRRHSDIHMLETSDNARIRLNTKDEKLTVHKAYQGGADFLFGYDVRE 236 

Query: 297 AKNTAL I FEKNVTGTSD I I FENP IDDLKSLDGHQI I KVNGTADKHAFRLSGKHQKGI YTL 356 

+ AL FE+ V+G S E ++LK+LDG + + I AD ++F + -f+G+Y L 

Sbjct: 237 SDKPALTFEEKVSGQSGWLERRPENLKTLDGRKL I AAE - KADSNS FAFKQNYRQGLYEL 295 

Query: 357 SLQQRPEGF LPKCKNAMI WR - - FMHNRLKPP I PYCALRLNNKNSD I FDRTLP 406 

L+Q GF + P+ + + + + N L LR + + D+ + 

Sbjct: 2 96 LLKQCEGGFCLGVQRLAIPEAEAVLYAQQAYAANTL FGLRAADRGDDVYAADPS 34 9 

Query: 407 RKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRS 466 

R+ LWLR I G S+Q ++G A +G RKGVQ+GGEVF QNE ++L+IG+MGG+A Q + 
Sbjct: 350 RQKLWLRF I GGRSHQNI RGGAA - ADGRRKGVQ I GGE VFVRQNEGS RL AI GVMGGRAGQHA 408 

Query: 4 67 TFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERF 52 6 

+ + + G+G GVYA WHQL+DKQTGAY D W+QYQRF+HRIN E+ ER+ 

Sbjct: 409 SVNGKG - -GAAGS YLHGYGGGVYAAWHQLRDKQTGAYLDGWLQYQRFKHRINDENRAERY 466 

Query: 52 7 TS KG I TAS I E AG YNALLAEHFTKKGNS LRVYLQPQAQLT YLGVNGKFSDS ENAHVNLLGS 58 6 

+KG TAS+E GYNAL+AE KGN++R YLQPQAQ TYLGVNG F+DSE V LLGS 
Sbjct: 467 KTKGWTASVEGGYNALVAEGWGKGNNVRFYLQPQAQFTYLGVNGGFTDSEGTAVGLLGS 526 

Query: 5 87 RQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQLG 64 6 

Q Q+R G++AK +F+L + ++PFAA N L+ +K FGVEMDGE++ + +TA+E + G 
Sbjct: 527 GQWQSRAGIRAKTRFALRNGVNLQPFAAFNVLHRSKSFGVEMDGEKQTLAGRTALEGRFG 586 

Query: 647 VAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

+ K H++- + + ++T K+ AL+L+W F 

Sbjct: 587 I E AGWKGHMS AR I GYGKRTDGD KEAALSLKWLF 619 
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tr Q6J5G0 Putative virulence -associated protein (Fragment) [Haemophilus 2 91 AA 
influenzae] 

align 

Score = 219 bits (557), Expect = 2e-55 

Identities = 136/291 (46%), Positives = 165/291 (55%), Gaps = 47/291 (16%) 

Query: 235 QT I S LD E FRLENS LWE PRWD SNVGKL KTTNAD I RFNTKS E S LL VKE 280 

Q I L E LENSLWEPRWDS+V L N+ IRFNTK+ESL+V E 
Sbjct : 1 QVIHLYELNLENSLWEPRWDSDVSYLSLYNSHIRFNTKNESLWGENRIRPTPDNALETE 60 

Query: 281 DYA- GGARFRFAYD - - PKEAKNTALI FEKNVTGTS 312 

DY G RFAYD +EA L + V G + 
Sbjct : 61 KDFKSRFSNIGYHSSSKIISYNNSRDYVLGYPSIRFAYDLSEREADKPVLTLKSKVRGKT 120 

Query: 313 DIIFENP-IDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKN 371 

I+FE +++LK+L Q+IK + AF L +++KG Y L LQQ P GF + 

Sbjct: 121 AIVFEEKALNNLKNLTYRQLIKTETDIEPDAFFLLEEYKKGRYRLFLQQCPNGFCIGVEK 180 

Query: 3 72 AMIWRFM HNRLKPP I PYCALRLNNKNSD I FDRTLPRKGLWLRVIDGHSNQWVQGKT 42 7 

I + 4- + ALRLN+KNSDIFDRTLPRKGLWLR+I GH +Q VQGKT 

Sbjct: 181 LAIPTHLVASYAQQAQAANTLFALRLNDKNSDIFDRTLPRKGLWLRLISGHLSQDVQGKT 240 

Query: 42 8 APVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTT 478 

APVEG" RKG+QLGG+VF+ QN+ Q S GLMGGQAEQRSTF NPDT N+TT 
Sbjct: 241 APVEGNRKGIQLGGDVFSLQNQDYQFSFGLMGGQAEQRSTFRNPDTGNVTT 291 



tr Q8X6C1 Putative beta-barrel outer membrane protein [z0402] 134 9 

[Escherichia coli AA 
0157 :H7] align 

Score = 142 bits (359) , Expect = 2e-32 

Identities = 100/327 (30%), Positives = 156/327 (47%), Gaps = 31/327 (9%) 

Query : 371 NAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGK-TAP 429 

N M +H RL N +D+ + +W+R GH W+W G 

Sbjct: 1039 NTMFTTRLHERLG NTYYTDMVTGEQKQTTMWMRHEGGH - NKWRDGSGQLK 1087 

Query: 43 0 VEGYRKGVQLGGEVFTW-QNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGV 488 

+ R +QLGG+V W QN S++ +G+M G S + T +V G+ G+ 

Sbjct: 1088 TQSNRYVLQLGGDVAQWSQNGSDRWHVGVMAGYGNSDSKTISSRTGYRAKASVNGYSTGL 1147 

Query : 489 YATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEHF 547 

YATW+ + + GAY DSW QY F + + +D +E 4- SKG TAS+EAGY LAE 
Sbjct: 1148 YATWYADDESRNGAYLDSWAQYSWFDNTVKGDDLQSESYKSKGFTASLEAGYKHKLAEFN 1207 

Query: 54 8 TKKGNSLRVYLQPQAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQA- - KAQFSL - 603 

+G Y-fQPQAQ-f T++GV K +S V+ G +QTR+GV+ K+ + 

Sbj ct : 12 08 GSQGTRNEWYVQPQAQVTWMGVKADKHRESNGTLVHSNGDGNVQTRLGVKTWLKSHHKMD 1267 

Query: 604 - YKNI AIEPFAAVNALYHNKPFGVEM DGERRVINNKTAIESQLGVAVKIKSHL 655 

K+ 4-PF VN L-f ++K F M DG R + KT +E QL + + + + 
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Sbjct: 1268 DGKSREFQPFVEVNWLHNSKDFSTSMDGVSVTQDGARNIAEIKTGVEGQLNANLNVWGNV 1327 

Query: 656 TLQATFNRQTGKHHQAKQGALNLQWTF 682 

+Q + + + + +W F 

Sbjct: 1328 GVQV ADRGYNDTSAMVGIKWQF 1349 



tr Q7AH8 9 AidA-I adhesin-like protein [ECs0362] [Escherichia coli 1327 
0157 :H7] AA 

align 

Score = 142. bits (359), Expect = 2e-32 

Identities = 100/327 (30%), Positives = 156/327 (47%), Gaps = 31/327 (9%) 

Query: 371 NAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGK-TAP 42 9 

N M +H RL N +D+ + +W+R GH N+W G 

Sbjct: 1017 NTMFTTRLHERLG NTYYTDMVTGEQKQTTMWMRHEGGH-NKWRDGSGQLK 1065 

Query : 43 0 VEGYRKGVQLGGEVFTW-QNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGV 4 88 

+ R +QLGG+V W QN S++ +G+M G S + T +V G+ G+ 

Sbjct: 1066 TQSNRYVLQLGGDVAQWSQNGSDRWHVGVMAGYGNSDSKTISSRTGYRAKASVNGYSTGL 1125 

Query: 489 YATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEHF 547 

YATW + + + GAY DSW QY F + 4- +D +E + SKG TAS+EAGY LAE 
Sbjct: 112 6 YATWYADDESRNGAYLDSWAQYSWFDNTVKGDDLQSESYKSKGFTASLEAGYKHKLAEFN 1185 

Query: 548 TKKGNSLRVYLQPQAQLTYLGVNG-KFSDSENAHVNLLGSRQLQTRVGVQA- -KAQFSL- 603 

+G Y+QPQAQ+T++GV K +S V+ G +QTR+GV+ K+ + 

Sbjct: 1186 GSQGTRNEWYVQPQAQVTWMGVKADKHRESNGTLWSNGDGNVQTRLGVKTWLKSHHKiyiD 1245 

Query: 604 -YKNIAIEPFAAVNALYHNKPFGVEM DGERRVINNKTAIESQLGVAVKIKSHL 655 

K+ + PF VN L+4- + K F M DG R + KT +E QL + + ++ 

Sbjct: 1246 DGKSREFQPFVEWWLHNSKDFSTSMDGVSVTQDGARNIAEIKTGVEGQLNANLNVWGNV 13 05 

Query: 65 6 TLQATFNRQTGKHHQAKQGALNLQWTF 682 

+Q + + + ++W F 

Sbjct: 1306 GVQV ADRGYNDTSAMVGIKWQF 1327 



tr Q6KCW6 YapH homo log [yapH] [Escherichia coli] 2 001 AA 

align 

Score = 139 bits (351) , Expect = le-31 

Identities = 87/294 (29%), Positives = 145/294 (48%), Gaps = 10/294 (3%) 

Query: 3 98 SDIFDRTLPRKGLWLRVIDGHSNQWVQGKTA-PVEGYRKGVQLGGEVFTWQNESNQLSIG 456 

+D+F +W+R I GH N+W + + R VQLGG + W + ++L G 

Sbjct: 1709 TD VFTGE KKATS MWMRH I GGH - NRWKDS S S QLNTQSNR Y WQLGGS I AQWTDGQDRLQQG 1767 

Query: 457 LMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHR 516 

+M G ++S + + + + G + G+ G+Y TW Q GAY D+W+QY F + 

Sbjct: 1768 IMAGYGNEKSSTTSSLSGYKSKGAINGYSTGLYGTWQQNDGNDNGAYVDTWIQYGWFNNT 1827 

Query : 517 INTED-GTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN-GKFS 574 

+N E E + S+G T S+EAGY E +G+ Y+ QPQ+Q+T++ V + + 
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Sbjct: 1828 WGEKLAAESWKSRGFTGSVEAGYTFKAGEFTGSQGSHYDWYIQPQSQITWMNVRASEHT 1887 

Query: 575 DSENAHVNLLGSRQLQTRVGV QAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDG 63 0 

+ V L G +Q+R+GV + K-f K EPF VN + + + -f- +GV+MD 

Sbjct: 1888 EKNGTKVQLSGDGNIQSRLGVRTYLKGKSASDDNKAHQFEPFVEVNWIHNTRSWGVKMDN 1947 

Query : 631 ERRVINNKTAI - E S QLGVAVKI KSHLTLQATFNRQTG - KHHQAKQGALNLQWTF 682 

- + T I E + GV K+ .+L + QGK+ QL + + + F 

Sbjct: 1948 TALSQDGATNIAEVKTGVQGKLSDNLNVWGMVGVQAGDKGYSDAQAMLGIKYIF 2001 



tr Q8FFF9 Hypothetical protein ydeU [ydeU] [Escherichia coli 291 AA 

06] align 

Score = 137 bits (344) , Expect = 9e-31 

Identities = 85/282 (30%) , Positives = 141/282 (49%) , Gaps = 10/282 (3%) 

Query: 410 LWLRVIDGHSNQWVQGKTA-PVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTF 468 

+W+R I GH N+W + + R VQLGG + W + ++L G+M G ++S + 
Sbjct : 11 MWMRH I GGH - NRWKDS S S QLNTQ SNR Y WQLGGS I AQWTDGQDRLQQG I MAG YGNE KS S T 69 

Query: 469 HNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED - GTERFT 527 

+ + + G + G-f G+Y TW Q GAY D+W+QY F + +N E E + 

Sb j C t : 70 TSSLSGYKSKGAINGYSTGLYGTWQQNDGNDNGAYVDTWIQYGWFNNTVNGEKLAAESWK 12 9 

Query: 528 SKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN-GKFSDSENAHVNLLGS 586 

S+G T S+EAGY E +G+ Y-+-QPQ+Q+T++ V + V L G 

Sbjct: 130 SRGFTGSVEAGYTFKAGEFTGSQGSHYDWYIQPQSQITWMNVRASEHTEKNGTKVQLSGD 189 

Query: 587 RQLQTRVGV QAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAI- 641 

+Q+R+GV + K+ K EPF VN +++ + +GV+MD + T I 

Sbjct: 190 GNIQSRLGWTYLKGKSASDDNKAHQFEPFVEWWIHNTRSWGVKMDNTALSQDGATNIA 249 

Query: 642 ESQLGVAVKIKSHLTLQATFNRQTG- KHHQAKQGALNLQWTF 682 

E + GV K-f +L + QGK+ QL +++ F 

Sbjct: 2 50 EVKTGVQGKLSDNLNVWGNVGVQAGDKGYSDAQAMLGIKYIF 2 91 



tr Q7BCK4 VirG [virG] [Shigella flexneri] 1102 AA 

align 

Score = 134 bits (338), Expect = 4e-30 

Identities = 88/297 (29%), Positives = 149/297 (49%), Gaps = 12/297 (4%) 

Query-: 398 SD I FDRT LPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVF - TWQNESNQL 453 

S I D T L +W+R + GH+ + + R Q+GG++ T + + L 

Sbjct : 806 S T I VDPTTGQL S ETTMW I RT VGGHNEHNL ADRQLKTTANRMVYQ I GGD I LKTNFTDHDGL 865 

Query : 454 S IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRF 513 

+G+MG Q S HN T + G V G+ AG+Y++W Q + ++TG Y D+W+QY F 
Sbjct : 866 HVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGLYSSWFQDEKERTGLYMDAWLQYSWF 925 

Query: 514 RHRINTEDGT-ERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK 572 

+ + + T E+++SKGIT ++EAGY +YL PQ Q+T GV 

Sbjct : 92 6 NNTVKGDGLTGE KYS S KG I TGALE AG Y I Y PT I RWTAHNN I DNAL YLNPQ VQ I TRHGVKAN 985 
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Query : 573 -FSDSENAHWLLGSRQLQTRVGVQAK- - AQFSLYKNI - - AIEPFAAVNALYHNKPFGVE 627 

+ + V G +Q ++G++ +Q + K EPF VN + +K +GV 

Sbjct : 986 DYI EHNGTMVTS SGGNNIQAKLGLRTSL I SQSC ID KETLRKFEPFLEVNWKWS S KQYGVI 1045 

Query: 628 MDG-ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH-HQAKQGALNLQWTF 682 

M+G I N+ IE + GV + + +L++ ++Q G + ++ QG L +++TF 

Sbjct: 1046 MNGMSNHQ I GNRNVI ELKTG VGGRL ADNL S I WGNVS QQLGNNS YRDTQG I LGVKYTF 1102 



tr Q99Q93 IcsA (VirG) , outermembrane protein exposed to the bacterial 
surface 

by a C- terminal autotransporter domain and involved in 
the movement of intracellular bacteria by binding to 
N-WASP (Invasion protein) [icsA (virG) ] [Shigella 
f lexneri] 

Score = 134 bits (338) , Expect = 4e-30 
Identities = 88/297 (29%) , Positives = 149/297 (49%) , Gaps = 12/297 (4%) 

Query: 398 SDI FDRT LPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVF - TWQNESNQL 453 

S I D T L +W+R + GH+ + + R Q+GG++ T + + L 

Sbjct : 806 STIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQIGGDILKTNFTDHDGL 865 

Query: 4 54 SI GLMGGQAEQRS TFHNPDTDNLTTGNVKGFGAG VYATWHQLQDKQTGAYADS WMQYQRF 513 

+G+MG Q S HN T + G V G+ AG+Y++W Q + ++TG Y D+W+QY F 
Sbjct: 866 HVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGLYSSWFQDEKERTGLYMDAWLQYSWF 925 

Query: 514 RHRINTEDGT-ERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK 5 72 

+ + + T E+++SKGIT ++EAGY +YL PQ Q+T GV 

Sbjct: 926 NNTVKGDGLTGE KYS S KG I TGALE AG Y I Y P T I RWTAHNN I DNAL YLNP QVQ I TRHGVKAN 985 

Query : 5 7 3 - FSDSENAHWLLGSRQLQTRVGVQAK- - AQFSLYKNI - - AIEPFAAVNALYHNKPFGVE 62 7 

+ + V G +Q 4-+G4-+ +Q + K EPF VN + +K . +GV 

Sbjct: 986 DYI EHNGTMVTS SGGNNIQAKLGLRTSL I SQSCIDKETLRKFEPFLEVNWKWSSKQYGV I 1045 

Query : 62 8 MDG-ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH-HQAKQGALNLQWTF 682 

M+G I N+ IE + GV ++ 4-L++ ++Q G + ++ QG L + ++TF 

Sbjct: 1046 MNGMSNHQ I GNRNVI ELKTGVGGRL ADNL S I WGNVS QQLGNNS YRDTQG I LGVKYTF 1102 



1102 
AA 

align 



tr Q52298 Virulence-associated VirG [Plasmid pMYSH6000] 1102 AA 

align 

Score = 134 bits (338) , Expect = 4e-30 

Identities = 88/297 (29%) , Positives = 149/297 (49%) , Gaps = 12/297 (4%) 

Query: 398 SD I FDRT- - -LPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVF- TWQNESNQL 453 

S I D T L +W+R + GH+ + + R Q+GG+4- T + + L 

Sbjct: 806 ST I VDPTTGQLS ETTMW I RTVGGHNEHNL ADRQLKTTANRMVYQ I GGD I LKTNFTDHDGL 865 

Query: 454 S I GLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADS WMQYQRF 513 

+G+MG Q. S HN T + G V G+ AG+Y++W Q + ++TG Y D+W+QY F 
Sbjct: 866 HVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGLYSSWFQDEKERTGLYMDAWLQYSWF 925 
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Query: 514 RHRINTEDGT-ERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK 572 

+ + + t E+++SKGIT ++EAGY +YL PQ Q+T GV 

Sbjct : 926 NNTVKGDGLTGEKYS S KG I TGALE AG Y I YPT I RWTAHNN IDNAL YLNPQ VQ I TRHGVKAN 985 

Query: 573 -FSDSENAHVNLLGSRQLQTRVGVQAK- -AQFSLYKNI - -AIEPFAAVNALYHNKPFGVE 627 

+ + V G +Q ++G+ + +Q + K EPF VN + +K +GV 

Sbjct : 986 DYIEHNGTMVTSSGGNNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVI 1045 

Query: 62 8 MDG-ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH-HQAKQGALNLQWTF 682 

M+G I N+ IE + GV ++ +L+ + ++Q G + ++ QG L + -+-+TF 

Sbjct: 1046 MNGMSNHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKYTF 1102 



tr Q9JMS3 YchA protein [ychA] [Escherichia coli] 1371 AA 

align 

Score = 134 bits (337) , Expect = 6e-30 

Identities = 122/461 (26%), Positives = 193/461 (41%), Gaps = 46/461 (9%) 

Query: 266 D I RFNTKS E S LL VKED YAGG ARFRFAYDPKEAKNTAL I FEKNVTGTSD 1 1 FENP IDD 322 

D+R L + +Y G D + L-f- + + +G + + N 

Sbjct: 913 DLR S AAPGN I LT I GGN YTGNNGTLL I NT VLDD S S S ATDKL VI KGD AS GKTRVAVTNVGG S 972 

Query: 323 -LKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEG FLPKCKNAMIWR 376 

+L+ ++I V+G A F +G+ G Y +L + P +L KN R 

Sbjct : 973 GANTLNS IE VIHVDGNAANAEF I QAGRI AAGAYDYTLGRGPGSNYGNWYLS S S KNTPEPR 1032 

Query: 377 FMHNRLKPP I P YCALRLNNKNSD I FDRTLPRKG LWL 412 

N L+P -h N+ R R G . +W+ 

Sbjct: 1033 PDPEPTPEGHDNNLRPEASSYTANIAAANTMFVTRLHERLGQTQYVDAITGEPKATSMWM 1092 

Query: 413 RVIDGHSNQWVQGK-TAPVEGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHN 470 

R GH N4-W G + R +QLGG++ W +N+ +G+M G S-h 

Sbjct: 10 93 RHEGGH-NRWRDGSGQLKTQSNRYVIQLGGDIAQWDWGGTNRWHLGVMAGYGNNHSSTGA 1151 

Query: 471 PDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGT- -ERFTS 528 

T + G+V G+ G+YATW+ + GAY D+W QY F + + DG E + S 
Sbjct: 1152 VRTGYHSKGSVNGYSTGLYATWYADDETHNGAYLDTWAQYGWFDNHVK-GDGLPGESWKS 1210 

Query: 52 9 KGITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG- KFSDSENAHVNLLGSR 5 87 

KG+TAS+E GY + E + GN Y+QPQAQL ++GV + +S + G 

Sbjct: 1211 KGLTASLETGYAWKIGEFSSNYGNLNEWYVQPQAQLVWMGVKADELYESNGTLIESTGDG 1270 

Query: 588 QLQTRVGVQA KAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAI -E 642 

+ TR+GV+ + K+ PF VN L++ + FGV M+GE + IE 

Sbjct: 1271 NWTRLGVKTWIKRLNKMDDGKSREFSPFVEVNWLHNTRDFGVRMNGEPVYQDGTRNIGE 1330 

Query: 643 SQLGVAVKIKSHLTLQATFNRQTG - KHHQAKQGALNLQWTF 682 

+ GV +1 HL L Q G K + L +++TF 

Sbjct: 13 31 VKTGVEGQINPHLNLWGNVRVQVGDKGYNDTSAMLGVKYTF 13 71 



tr Q6WE2 7 IcsA [icsA] [Shigella f lexneri] 1102 AA 

align 
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Score = 134 bits (336) , Expect = 7e-30 

Identities = 90/301 (29%), Positives = 150/301 (48%), Gaps = 20/301 (6%) 

Query: 3 98 SDIFDRT LPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVF-TWQNESNQL 453 

S IDT L +W+R + GH+ + + R Q+GG++ T + + L 

Sbjct: 806 ST I VD PTTGQL S ETTMW I RTVGGHNEHNL ADRQLKTTANRMVYQ I GGD I LKTNFTDHDGL 865 

Query: 4 54 SIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRF 513 

+G+MG Q S HN T + G V G+ AG+Y++W Q + ++TG Y D+W+QY F 
Sbjct : 866 HVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGLYSSWFQDEKERTGLYMDAWLQYGWF 925 

Query: 514 RHRINTEDGT-ERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK 572 

+ + + T E+++SKGIT ++EAGY T +YL PQ Q+T GV 

Sbj Ct : 926 NNTVKGDGLTGE KYS S KG I TGALE AG Y I Y PT I RWTTHNN I DNAL YLNP QVQ I TRHGVKAN 985 

Query: 573 -FSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAI E P FAAVNAL YHNKP 623 

+ 4- V G +Q ++G+ + SL + I EPF VN + + K 

Sbjct : 986 DYIEHNGTMVTSSGVNNIQAKLGLRT SLISHSCIDKETLRKFEPFLEVNWKWSSKQ 1041 

Query: 624 FGVEMDG - ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH -HQAKQGALNLQWT 681 

+GV M+G I N+ IE + GV ++Q G + ++ QG L +++T 

Sbjct: 1042 YGV I MNGMSNHQ I GNRNV I EL KTGVGGRL ADNL S I WGNVS QQLGNNS YRDTQG I LGVKYT 1101 

Query: 682 F 682 

F 

Sbjct: 1102 F 1102 



tr Q8ZHA1 Putative autotransporter protein [yapH] [Yersinia 3705 AA 

pestis] align 

Score = 130 bits (326) , Expect = le-28 

Identities = 106/385 (27%), Positives = 170/385 (43%), Gaps = 36/385 (9%) 

Query : 324 KSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRLK 3 83 

+ + +G +11 V G + F L+G+ GYLQ PN+ +R 

Sbjct: 3331 RTFEGIKIIDVGGDSAGQ-FTLNGRAVGGAYEYFLYQGGAS - TPDDGNWYLRTEADDRRP 3388 

Query: 384 PPIPYCALRLNNKN SDI FDRTLPRKGLWLRVIDGHSNQWVQGKTA 428 

P Y A N +D+F LWLR H+ 

Sbjct: 3389 EPASYTANLAAANNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGEL 3448 

Query: 42 9 PVEGYRKGVQLGGEVFTWQNESNQL-SIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAG 4 87 

+ R +QLGG+V W + L +G+M G A S+ +TG+V G+ G 

Sbjct: 3449 KTQDNRYViyiQLGGDVAQWSRNAQDLWRVGWIAGYANSSSSTVAQVAGYRSTGSVDGYSVG 3508 

Query: 488 VYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEH 546 

+Y +W TGAY DSW+QY F +R++ +D TE++ SKG TAS+E GY + E 

Sbjct: 3509 IYGSWLADNADDTGAYVDSWVQYSWFDNRVSGQDLATEKYDSKGFTASVEGGYAFKVGES 3568 

Query: 547 FTKKGNSLRVYLQPQAQLTYLGVNG-KFSDSENAHVNLLGSRQLQTRVGVQA KA 599 

+ ++QP+AQ+ ++GV +++ ++ G+ +QTR+G +A KA 

Sbjct: 3569 VNQS YF IQPKAQVVWMGVKADDHTETNGTVI SGDGNGNI QTRLGAKAF INPSDKA 3623 

Query : 60 0 QFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER-RVINNKTAIESQLGVAVKIKSHLTLQ 658 

4- S A + PF N +++ K FG 4-DG + E +LGV ++ S L L 
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Sbjct: 3624 KVS GPAFKPFVEANWIHNTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVNSQLNLW 3680 

Query: 659 ATFNRQTG - KHHQAKQGALNLQWTF 682 

+Q G K + L + + + F 

Sbjct: 3 681 GN I GQQ VGNKG YS ETS WLGVKYNF 37 05 



tr Q8CZU2 Putative autotransporter adhesin [yapH] [Yersinia 3710 AA 

pestis] ajyign 

Score = 130 bits (326) , Expect = le-28 

Identities = 106/385 (27%), Positives = 170/385 (43%), Gaps = 36/385 (9%) 

Query: 324 KSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRLK 3 83 

+ + +G +11 V G + F L+G-f GYLQ PN+ +R 

Sbjct: 3336 RTFEGIKI IDVGGDSAGQ- FTLNGRAVGGAYEYFLYQGGAS -TPDDGNWYLRTEADDRRP 3393 

Query: 3 84 PP I P YCALRLNNKN SDIFDRTLPRKGLWLRVIDGHSNQWVQGKTA 42 8 

p Y A N +D+F LWLR H+ 

Sbjct: 3394 EPASYTANLAAANNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGEL 3453 

Query: 42 9 PVEGYRKGVQLGGEVFTWQNESNQL - S IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAG 4 87 

+ R +QLGG+V W + L +G+M G A S + +TG+V G-f G 

Sbjct: 3454 KTQDNRYWQLGGDVAQWSRNAQDLWRVGVKAGYANSSSSTVAQVAGYRSTGSVDGYSVG 3513 

Query : 4 8 8 VYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEH 54 6 

+Y +W TGAY DSW+QY F +R++ +D TE+ + SKG TAS+E GY + E 

Sbjct: 3514 IYGSWLADNADDTGAYVDSWVQYSWFDNRVSGQDLATEKYDSKGFTASVEGGYAFKVGES 3573 

Query: 547 FTKKGNSLRVYLQPQAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQA KA 599 

+ ++ Qp + AQ+ ++GV + + + + + G-f +QTR+G +A KA 

Sbjct: 3574 VNQS YFIQPKAQWWMGVKADDHTETNGTVISGDGNGNIQTRLGAKAFINPSDKA 3628 

Query: 600 QFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER-RVINNKTAIESQLGVAVKIKSHLTLQ 658 

+ S A +PF N + + + K FG +DG + E +LGV ++ S L L 

Sbjct: 3 62 9 KVS GPAFKPFVEANWIHNTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVNSQLNLW 3 685 

Query: 659 ATFNRQTG -KHHQAKQGALNLQWTF 682 

+Q G K + L +++ F 

Sbjct: 3686 GN I GQQVGNKGYS ETS WLGVKYNF 3710 



tr Q9F289 YapD protein [yapD] [Yersinia pestis] 1457 AA 

align 

Score = 130 bits (326) , Expect = le-28 

Identities = 106/385 (27%) , Positives = 170/385 (43%) , Gaps = 36/385 (9%) 

Query : 324 KSLDGHQI I KVNGTADKHAFRLSGKHQKGI YTLSLQQRPEGFLPKCKNAMIWRFMHNRLK 3 83 

++ +G +11 V G + F L+G+ GYLQ P N + +R 

Sbjct: 1083 RTFEGIKI IDVGGDSAGQ- FTLNGRAVGGAYEYFLYQGGAS -TPDDGNWYLRTEADDRRP 1140 

Query: 384 PP IP YCALRLNNKN SDIFDRTLPRKGLWLRVIDGHSNQWVQGKTA 428 

P Y A N +D+F LWLR H+ 

Sbjct: 1141 EPASYTANLAAANNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGEL 1200 
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Query: 42 9 PVEGYRKGVQLGGEVFTWQNESNQL - SIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAG 4 87 

+ R +QLGG+V W + L +G+M G A S+ 4-TG+V G+ G 

Sbjct: 1201 KTQDNRYVMQLGGDVAQWSRNAQDLWRVGVMAGYANSSSSTVAQVAGYRSTGSVDGYSVG 1260 

Query : 4 88 VYATWHQLQDKQTGAYADS WMQ YQRFRHR INTED - GTERFTS KG I TAS I E AGYNALL AEH 54 6 

+Y +W TGAY DSW+QY F 4-R++ 4-D TE++ SKG TAS+E GY + E 

Sbjct: 1261 I YG S WL ADNADDTGAY VD S W VQ Y S W FDNR VS GQDL ATE K YD S KG F T AS VE GG Y AF KVGE S 1320 

Query: 547 FTKKGNSLRVYLQPQAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQA KA 599 

4- ++QP+AQ+ ++GV 4-4-4- ++ G+ +QTR+G +A KA 

Sbjct: 1321 VNQS YFIQPKAQVWMGVKADDHTETNGTVISGDGNGNIQTRLGAKAFINPSDKA 1375 

Query : 600 QFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER-RVINNKTAIESQLGVAVKIKSHLTLQ 658 

+ S A +PF N 4-4-4- K FG +DG 4- E 4-LGV 4-4- S L L 

Sbjct: 1376 KVS GPAFKPFVEANWIHNTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVNSQLNLW 1432 

Query: 659 ATFNRQTG - KHHQAKQGALNLQWTF 682 

+Q G K 4- L 4-4-4- F 

Sbjct: 1433 GNIGQQVGNKGYSETSWLGVKYNF 1457 



tr Q 9F285 YapH protein [yapH] [Yersinia pestis] 3705 AA 

align 

Score = 130 bits (326) , Expect = le-28 

Identities = 106/385 (27%), Positives = 170/385 (43%), Gaps = 36/385 (9%) 

Query: 324 KSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRLK 3 83 

4-4 +G +11 V G 4- F L4-G+ GYLQ P N 4- +R 

Sbjct: 3331 RTFEG I KI ID VGGD S AGQ - FTLNGRAVGGAYE YFL YQGGAS - TPDDGNW YLRTE ADDRRP 3388 

Query: 384 PP I P YCALRLNNKN SDIFDRTLPRKGLWLRVIDGHSNQWVQGKTA 428 

P Y A N +D4-F LWLR H+ 

Sbjct: 3389 EPASYTANLAAANNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGEL 3448 

Query: 42 9 PVEGYRKGVQLGGEVFTWQNESNQL -SIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAG 4 87 

4- R 4-QLGG4-V W 4- L +G+M G A S+ 4-TG4-V G+ G 

Sbjct: 3449 KTQDNRYVT^QLGGDVAQWSRNAQDLWRVGVMAGYANSSSSTVAQVAGYRSTGSVDGYSVG 3508 

Query : 488 VYATWHQLQDKQTGAYADS WMQ YQRFRHR INTED -GTERFTS KG I TAS IE AG YNALLAEH 546 

+Y +W TGAY DSW4-QY F +R+4- 4-D TE4-+ SKG TAS4-E GY + E 

Sb j c t : 35 09 I YGSWLADNADDTGAYVDSWVQYSWFDNRVSGQDLATEKYDSKGFTASVEGGYAFKVGES 3 56 8 

Query: 547 FTKKGNSLRVYLQPQAQLTYLGVNG- KFSDSENAHVNLLGSRQLQTRVGVQA KA 599 

+ 4-4-QP4-AQ4- 4-4-GV 4-4-4- 4-+ G+ 4-QTR+G +A KA 

Sbjct: 3569 VNQS YFIQPKAQWWMGVKADDHTETNGTVISGDGNGNIQTRLGAKAFINPSDKA 3623 

Query: 60 0 QFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER-RVINNKTAIESQLGVAVKIKSHLTLQ 65 8 

+ S A 4-PF N +++ K FG 4-DG 4- E 4-LGV 4-4- S L L 

Sbjct: 3624 KVS GPAFKPFVEANWIHNTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVNSQLNLW 3680 

Query: 659 ATFNRQTG -KHHQAKQGALNLQWTF 682 

4-Q G K 4- L 4-4-4- F 

Sbjct: 3681 GNIGQQVGNKGYSETSWLGVKYNF 3705 
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tr Q74QP7 Putative autotransporter protein [yapH] [Yersinia 3710 AA 

pestis] align 

Score = 130 bits (326) , Expect = le-28 

Identities = 106/385 (27%), Positives = 170/385 (43%), Gaps = 36/385 (9%) 

Query: 324 KSLDGHQIIKWGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKNAMIWRFMHNRLK 3 83 

++ +G +11 V G + F L+G + GYLQ PN+ +R 

Sbjct: 3336 RTFEGIKIIDVGGDSAGQ-FTLNGRAVGGAYEYFLYQGGAS-TPDDGNWYLRTEADDRRP 3393 

Query: 384 PPIPYCALRLNNKN SDIFDRTLPRKGLWLRVIDGHSNQWVQGKTA 428 

P Y A N +D+F LWLR H+ 

Sbjct: 3394 EPASYTANLAAANNMFVTSLADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGEL 3453 

Query: 42 9 PVEGYRKGVQLGGEVFTWQNESNQL - SIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAG 4 87 

+ R +QLGG+V W + L +G+M G A S+ +TG+V G+ G 

Sbjct: 3454 KTQDNRYVMQLGGDVAQWSRNAQDLWRVGVMAGYANSSSSTVAQVAGYRSTGSVDGYSVG 3513 

Query : 488 VYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEH 546 

+Y +W TGAY DSW+QY F +R+ + +D TE++ SKG TAS+E GY + E 

Sbjct: 3514 IYGSWLADNADDTGAYVDSWVQYSWFDNRVSGQDLATEKYDSKGFTASVEGGYAFKVGES 3573 

Query : 547 FTKKGNSLRVYLQPQAQLTYLGVNG-KFSDSENAHVNLLGSRQLQTRVGVQA KA 599 

+ ++ Qp + AQ-f ++GV +++ ++ G+ +QTR+G +A KA 

Sbjct: 3574 VNQS YFIQPKAQWWMGVKADDHTETNGTVISGDGNGNIQTRLGAKAFINPSDKA 3628 

Query : 600 QFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER-RVINNKTAIESQLGVAVKIKSHLTLQ 658 

+ S A +PF N +++ K FG +DG + E +LGV ++ S L L 

Sbjct: 3629 KVS GPAFKPFVEANWIHNTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVNSQLNLW 3685 

Query: 659 ATFNRQTG - KHHQAKQGALNLQWTF 682 

+Q G K + L +++ F 

Sbjct: 3686 GN I GQQVGNKG YS ET S WLGVKYNF 3710 



tr Q9F286 YapG protein (Putative ATP -binding transport component) 994 
(Putative AA 
autotransporter protein) [yapG] [Yersinia pestis] align 

Score = 129 bits (324), Expect = 2e-28 

Identities = 84/272 (30%), Positives = 132/272 (47%), Gaps = 12/272 (4%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ -NESNQLS IGLMGGQAEQRSTF 468 

+W+ G S + ++G VQLGG++ W NE IG++ G + + 

Sbjct: 723 MWMHTQGGRSQFGHTVEQLNIKGNYYSVQLGGDIAQWATNEQGSGRIGMLAGLGKATNHS 782 

Query: 4 69 HNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED - GTERFT 52 7 

H+ T + G V G+ G+YATW Q TG Y D+ QY F + +N +D E++ 
Sbjct: 783 HSKVTSYHSRGAVDGYNLGIYATWFADQQHNTGVYIDTLAQYSWFNNAVNGQDKAEEKYK 842 

Query: 528 SKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK-FSDSENAHVNLLGS 586 

S G T SIE+GY LA + L ++QP AQ+T+ G+N + + 4- A V+ 4- 

Sbjct: 843 SSGFTTSIESGYTFNLA NSDQLSYFIQPNAQITWAGINAQTHKTADGAWSYRNN 897 

Query: 587 RQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNK- -TAIESQ 644 
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TR+G +A Q N PF AVN + + N + G + G+ I+NK + E 

Sbjct: 898 GHFITRIGAKAYLQTHDTLNTKFTPFVAVNWIHQNQNTGTTISGQG- - IDNKIQMSTEFN 955 

Query: 645 LGVAVKIKSHLTLQATFNRQTGKHHQAKQGAL 676 

+GV +1 L + A N Q G+ + + AL 
Sbjct: 956 VGVESQIDQQLHIWANINHQIGRYNYTDTNAL 987 



tr Q74RV9 Putative autotransporter protein [YP2907] [Yersinia pestis] 994 AA 

align 

Score = 129 bits (324) , Expect = 2e-28 

Identities = 84/272 (30%), Positives = 132/272 (47%), Gaps = 12/272 (4%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTF 468 

+W+ G S + ++G VQLGG++ W NE IG+ + G 4- + 

Sbjct: 72 3 MWMHTQGGRSQFGHTVEQLNIKGNYYS VQLGGDIAQWATNEQGSGRIGMLAGLGKATNHS 782 

Query: 469 HNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED - GTERFT 527 

H+ T + G V G+ G+YATW Q TG Y D+ QY F + +N +D E + + 
Sbjct: 783 HSKVTSYHSRGAVT)GYNLGIYATWFADQQHNTGVYIDTLAQYSWFNNAVNGQDKAEEKYK 842 

Query: 52 8 SKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK-FSDSENAHVNLLGS 586 

S G T SIE+GY LA + L ++QP AQ+T+ G+N + ++ A V+ + 

Sbjct: 843 S SGFTTS I ESGYTFNLA NSDQLSYFIQPNAQITWAGINAQTHKTADGAWSYRNN 897 

Query: 587 RQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINWK- -TAIESQ 644 

TR+G +A Q M PF AVN ++ N+ G + G+ I+NK + E 

Sbjct: 898 GHFITRIGAKAYLQTHDTLNTKFTPFVAVNWIHQNQNTGTTISGQG- - IDNKIQNSTEFN 955 

Query: 645 LGVAVKIKSHLTLQATFNRQTGKHHQAKQGAL 676 

+GV +1 L + A N Q G+++ AL 
Sbjct: 956 VG VE S Q I D QQLH I WAN I NHQ I GR YN YTD TNAL 987 



sp P77286 Hypothetical protein ydeU [ydeU] [Escherichia 466 AA 

YDEU_ECOLI coli] align 

Score = 129 bits (323), Expect = 2e-28 

Identities = 87/323 (26%), Positives = 155/323 (47%), Gaps = 23/323 (7%) 

Query: 371 NAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPV 430 

N++ +H+RL P + +D +W+R + GH 

Sbjct: 156 NSLFSHRLHDRLGEP QYTDSLHSQGSASSMWMRHVGGHERSRAGDGQLNT 205 

Query: 431 EGYRKGVQLGGEVFTWQ -NESNQLS IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVY 4 89 

+ R +QLGG++ W N ++ +G+M GAQS + +G+G+ AG+Y 

Sbjct: 206 QANRYVLQLGGDLAQWSSNAQDRWHLGVMAGYANQHSNTQSNRVGYKSDGRISGYSAGLY 265 

Query: 490 ATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEHFT 548 

ATW+Q +TGAY DSW Y F + +++++ + + S+G+TAS+E GY 
Sbjct: 266 ATWYQNDANKTGAYVDSWALYNWFDNSVSSDNRSADDYDSRGVTASVEGGYTFEAGTFSG 325 

Query: 549 KKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSRQLQTRVGVQ AKAQFSL 603 
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+G Y+QPQAQ+T++GV + + + + G +QTR+GV+ + Q 

Sbjct: 326 SEGTLNTWYVQPQAQITWMGVKDSDHTRKDGTRIETEGDGNVQTRLGVKTYLNSHHQRDD 385 

Query: 604 YKNIAIEPFAAVNALYHNKPFGVEMDGE RRVINNKTAI ESQLGVAVKI KSHLTLQAT 660 

K + P+ N + ++K + V+M+G+ R N E + GV K+ ++L+L 
Sbjct: 3 86 GKQRE FQ P Y I E ANW INNS KWAVKMNGQTVGREGARNLG - - E VRTGVE AKV1JNNLS L WGN 443 

Query: 661 FNRQTG - KHHQAKQGALNLQWTF 682 

Q G K 4- QG L +++ + + 
Sbjct: 444 VGVQLGDKGYSDTQGMLGVKYSW 466 



tr Q8XAY3 Putative ATP -binding component of a transport system and 466 
adhesin AA 
protein [z2196] [Escherichia coli 0157 :H7] align 

Score = 129 bits (323), Expect = 2e-28 

Identities = 86/321 (26%), Positives = 156/321 (47%), Gaps = 19/321 (5%) 

Query: 371 NAMIWRFMHNRLKPP I PYCALRLNNKNSD I FDRTLPRKGLWLRVIDGHSNQW VQGKTAP V 43 0 

N+ + +H+RL P +L + S +W+R + GH - 

Sbjct: 156 NSLFSHRLHDRLGEPQYTDSLHSQDSASS MWMRHVGGHERS S AGDGQLNT 205 

Query: 431 EGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVY 4 89 

+ R +QLGG++ W N +G+M GAQS + +G+G+ AG+Y 

Sbjct: 206 QANRYVLQLGGDLAQWSSNAQDRWHLGVMAGYANQHSNTQSNRVGYKSDGRISGYSAGLY 265 

Query: 490 ATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEHFT 548 

ATW+Q +TGAY DSW Y F + +++++ + + S+G+TAS+E GY 
Sbjct: 2 66 ATWYQNDANKTGAYVDSWALYNWFDNS VSSDNRSADDYDSRGVTASVEGGYTFEAGTCSG 32 5 

Query: 549 KKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSRQLQTRVGVQ AKAQFSL 603 

+G Y+QPQAQ+T++GV + + + + G +QTR+GV+ + Q 

Sbjct: 326 SEGTLNTWYVQPQAQITWMGVKDSDHARKDGTRIETEGDGNVQTRLGVKTYLNSHHQRDD 385 

Query: 604 YKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAI -ESQLGVAVKI KSHLTLQATFN 662 

K +P+ N + ++K + V+M+G+ + + E + GV K+ + +L+L 

Sbjct: 386 GKQRE FQP Y I EANW INNS KVYAVKMNGQT VS RDGAI^NLGE WTGVE AKVNNNLS LWGNVG 445 

Query: 663 RQTG- KHHQAKQGALNLQWTF 6 82 

Q G K + QG L +++ + + 
Sbjct: 446 VQLGDKGYSDTQGMLGVKYSW 466 



tr Q7AE01 Putative ATP-binding component of a transport system and 466 
adhesin 

protein [ECs2116] [Escherichia coli 0157 :H7] align 
Score = 129 bits (323) , Expect = 2e-28 

Identities = 86/321 (26%), Positives = 156/321 (47%), Gaps = 19/321 (5%) 

Query: 3 71 NAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPV 43 0 

N+ + +H+RL P +L 4- S +W+R + GH 

Sbjct: 156 NSLFSHRLHDRLGEPQYTDSLHSQDSASS MWMRHVGGHERS SAGDGQLNT 205 
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Query: 431 EGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVY 489 

+ R +QLGG++ W N -H+ +G+M GAQS + + G + G-f AG+Y 

Sbjct: 206 QANRYVLQLGGDLAQWSSNAQDRWHLGVMAGYANQHSNTQSNRVGYKSDGRISGYSAGLY 265 

Query: 490 ATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITASIEAGYNALLAEHFT 548 

ATW+Q +TGAY DSW Y F + + ++++ + + S+G+TAS-fE GY 
Sbjct: 266 ATWYQNDANKTGAYVDSWALYNWFDNSVSSDNRSADDYDSRGVTASVEGGYTFEAGTCSG 325 

Query: 549 KKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSRQLQTRVGVQ AKAQFSL 603 

+G Y+QPQAQ+T++GV 4- + + + G +QTR+GV+ + Q 

Sbjct: 32 6 SEGTLNTWYVQPQAQITWMGVKDSDHARKDGTRIETEGDGNVQTRLGVKTYLNSHHQRDD 3 85 

Query: 604 YKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAI-ESQLGVAVKIKSHLTLQATFN 662 

K +P+ N + ++K + V+M+G+ + + E + GV K+ 
Sbjct: 386 GKQRE FQP Y I E ANW I NNS KVYAVKMNGQTVS RDGARNLGE VRTGVE AKVNNNL S LWGNVG 445 

Query: 663 RQTG - KHHQAKQGALNLQWTF 6 82 

Q G K + • QG h + + + + + 
Sbjct: 446 VQLGDKGYSDTQGMLGVKYSW 466 



tr Q83LF4 Hypothetical protein SF1157 [SF1157] [Shigella 773 AA 

flexneri] align 

Score = 128 bits (321) , Expect = 4e-28 

Identities = 139/563 (24%), Positives = 231/563 (40%), Gaps = 85/563 (15%) 

Query: 174 LGPKIKNSHINSEILSVGNYTEWANQVIHH IENYVSFAAHLYSGLDPFHYI 224 

L I +ILS G+ AN++++H ++ VS AA ++ H 

Sbjct: 199 LESSINGQEATVDILS -GSSLRSANEILYHKDETSNVTITDSEVSSAADVFINNIKGHLT 257 

Query: 225 EVTDNSHVIGQT - 1 SLDEFR LENS LWE PRWDSNVGKLKTTNAD IRFN T 271 

NS + G IS D+ . +NS W+ + DS V L N+ + + 

Sbjct: 258 VD ATNS KI TGS ANI S TDDNTHT YLS LSDNS TWD I KADST VSNLT VDNS T VY I S RADGRD V 317 

Query: 272 KSESLLVKEDYAGG ARFRFAYDPKEAKNTALIFEKNVTGTSDIIFENPIDD-LKSLD 327 

+ L + E+Y G R D + ' ++ N +GT+ + N +L+ 

Sbjct: 318 EPTRLTITENYVGNNGVLHLRTELDDDNSATDKWINGNTSGTTRVKVTNAGGSGAYTLN 3 77 

Query: 328 GHQ 1 1 KVNGTAD KHAFRLSGKHQ ' KGIYTLSLQQRPEGFLPKCKN 371 

G +11 V G + + K++G++ KY+Q G + 

Sbjct: 378 GI E I I S VEGESNGEF I KDSRI FAGAYEYSLTRGNTEATNKNWYLTNFQATSGGETNSGGS 437 

Query: 3 72 AMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKG LWLRVI 415 

+ L+P L N+ R R G LWLR I 

Sbjct: 438 SAPTVAPTPVLRPEAGSYVANLAAANTLFVMRLNDRAGEMRYIDPVTEQERSSRLWLRQI 4 97 

Query: 416 DGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHNPDTD 4 74 

GH+ +R QLGG++ T +S+ +G+M G A + H+ +D 

Sbjct: 4 98 GGHNAWRDSNGQLRTTSHRYVSQLGGDLLTGGFTDSDSWRLGVMAGYARDYNLTHSSVSD 557 

Query: 475 NLTTGNVKGFGAGVYATWHQLQDKQTGAYAD S WMQYQRFRHR INTED - GTERFTS KG I TA 533 

+ G+V+G+ AG-fYATW + GAY DSW QY F++ + + + E + + + KG T 

Sbjct: 558 YRSKGSVRGYSAGLYATWFADDISKKGAYIDSWAQYSWFKNSVKGDELAYESYSAKGATV 617 

Query: 534 SIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSR 587 



http://us.expasy.org/cgi-bin/blast.pl 



9/14/04 



ExPASy BLAST2 Interface 



Page 25 of 62 



S+EAGY L + F + QPQAQ ++GV+ NAH GSR 

Sbjct: 618 SLEAG YGFALNKS FGLEAAKYTWI FQPQAQAI WMGVD HNAHTE ANG SRI END AN 671 

Query: 588 -QLQTRVG VQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDG ERRVINN- - 63 7 

+QTR+G ++ + + S EPF +N + +++K F V M+G E+ ++N 

Sbjct: 672 NNI QTRLGFRTF I RTQE KNS GPHGDD FE PF VEMNWI HNS KDFAVSMNGVKVEQDGVSNLG 731 

Query: 63 8 - -KTAIESQLGVAVKIKSHLTLQ 658 

K + L A + + + +Q 
Sbjct: 7 32 E I KLGVNGNLNPAAS VWGNVGVQ 754 



tr Q7UCU4 Hypothetical protein [S1242] [Shigella flexneri] 773 AA 

align 

Score = 128 bits (321) , Expect = 4e-28 

Identities = 139/563 (24%), Positives = 231/563 (40%), Gaps = 85/563 (15%) 

Query: 174 LGPKI KNSHINSE I LS VGNYTEWANQ VTHH 1 ENYVS F AAHL YSGLDP FHY I 224 

L I +ILS G+ AN++++H ++ VS M ++ H 

Sbjct: 199 LESSINGQEATVDILS-GSSLRSANEILYHKDETSNVTITDSEVSSAADVFINNIKGHLT 257 

Query: 225 EVTDNSHVIGQT- ISLDEFR LENS LWE PRWD SNVGKLKTTNAD I RFN T 271 

NS + G IS D+ +NS W+ + DS V L N+ + + 

Sbjct: 258 VDATNSKITGSANISTDDNTHTYLSLSDNSTWDIKADSTVSNLTVDNSTVYISRADGRDV 317 

Query: 272 KSESLLVKEDYAGG AR FR F AYD P KE AKNT AL I F E KNVTGT S DIIFENPIDD-LKS LD 327 

+ L + E+Y G R D + ++ N +GT+ + N + L+ 

Sbjct: 318 EPTRLTITENYVGNNGVLHLRTELDDDNSATDKWINGNTSGTTRVKVTNAGGSGAYTLN 377 

Query: 328 GHQ 1 1 KVNGTAD KHAFRLSGKHQ KGIYTLSLQQRPEGFLPKCKN 371 

G +11 V G + + K + +G ++ K Y + Q G + 

Sbjct: 378 GIEIISVEGESNGEFIKDSRIFAGAYEYSLTRGNTEATNKNWYLTNFQATSGGETNSGGS 437 

Query: 372 AMI WRFMHNRLKPP I PYCALRLNNKNSD I FDRTLPRKG LWLRVI 415 

+ L+P L N+ R R G LWLR I 

Sbjct: 438 SAPTVAPTPVLRPEAGSYVANLAAANTLFVMRLNDRAGETRYIDPVTEQERSSRLWLRQI 497 

Query: 416 DGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHNPDTD 474 

GH+ +R QLGG++ T +S+ +G+M G A + H+ +D 

Sbjct: 498 GGHNAWRDSNGQLRTTSHRYVSQLGGDLLTGGFTDSDSWRLGVMAGYARDYNLTHSSVSD 557 

Query: 475 NLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFTSKGITA 533 

+ G+V+G+ AG-fYATW + GAY DSW QY F++ 4- ++ E + ++KG T 

Sbjct: 558 YRSKGSVRGYSAGLYATWFADDISKKGAYIDSWAQYSWFKNSVKGDELAYESYSAKGATV 617 

Query: 534 SIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSR 587 

S+EAGY L + F + QPQAQ ++GV+ NAH GSR 

Sbjct: 618 SLEAG YGFALNKS FGLEAAKYTWI FQPQAQAI WMGVD HNAHTEANGSRIENDAN 671 

Query: 588 -QLQTRVG VQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDG ERRVINN - - 637 

+QTR+G ++ + + S EPF +N + + + 4-K F V M+G E+ ++N 

Sbjct: 672 NNIQTRLGFRTFIRTQEKNSGPHGDDFEPFVEMNWIHNSKDFAVSMNGVKVEQDGVSNLG 731 

Query: 63 8 - -KTAIESQLGVAVKIKSHLTLQ 658 

K + L A + +Q 
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tr Q9CKA8 Hypothetical protein PM1717 [PM1717] [Pasteurella multocida] 850 AA 

align 

Score = 126 bits (317) , Expect = le-27 

Identities = 117/434 (26%) , Positives = 199/434 (44%) , Gaps = 47/434 (10%) 

Query: 274 ESLLVKEDYAG GARFRFAYDPKEAKNTALIFEKNVTGTSDIIFEN-PIDDLKS 325 

+++ + EDY G A F + P -h L N TG + + + D " + 

Sbjct: 42 9 KNVTINEDYQGHNGTLHLSADFNGTTNPTDT L F I RGNATGKTRVA I HH I GADAENA 4 84 

Query: 326 LDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPE GFLPKCKNAM 373 

+ +G +11+ N + D +AF + KG + L4-+R E G P + M 

Sbjct: 485 VNGVK HE TNTS TD - NAF V I DN YL S KGAF V YHL E KRHE TNQDNW YLT S Y I GGT P S YRAEM 543 

Query: 374 I WRFMHNRLKPPI P YCALRLNNKNS - D I FDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEG 432 

N L LRL ++ S F K W+R ++G + + + + + 

Sbjct: 544 A- -SYANNLYAAHQLFQLRLEDRLSRHHFLNQSADKTFWIRAVEGTNHNRMRDNQNTTKA 601 

Query: 43 3 YRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATW 4 92 

R QLG V N+++ + G+M G A+Q S + + G V+G+ GVY TW 

Sbjct: 602 QRYVTQLGKTVI NQAHYHA - GVMFG YAKQS S KTRS S RVGT - SRGKVQG YALGVYGTW 656 

Query: 493 HQLQDKQTGAYADSWMQYQRFRHR-INTEDGTERFTSKGITASIEAGYNALLAEHFTKKG 551 

+Q + TG Y DSW+QYQ F+++ IN ++ + ++G++AS+E GY+ L + +T 
Sbjct: 657 YQNPNDDTGLYIDSWLQYQWFKNQVINPASSSDNYRTQGLSASLELGYHLPLVQ-YTVAD 715 

Query: 552 NSLRVYLQPQAQLTYLGVNGK-FSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIA- - 608 

+ +QPQAQ + +N K D + ++ +G + • QTR+GV +FSL + 
Sbjct: 716 LKHSLNIQPQAQFIWQKLNSKQHRDPQQTLIHYIGQQNTQTRLGV RFSLDSHFLNT 771 

Query: 609 1 E PFAAVNAL YHNKPFGVEM - DGERRVINNKTAI E S QLGVAVKI KS HLTLQATFNRQ 664 

++P+ VN L+H K +G+ + D + K E 4- G+A + . HL Q 

Sbjct: 772 QWNLKPYFEVNWLHHAKDYGITINDWNHIEGAKQLFEYKAGIASQFGRHLRFWLDTTHQ 831 

Query: 665 TGKHHQAKQGALNL 678 

GK Q K LN+ 
Sbjct: 832 RGK-QQFKDNQLNV 844 



tr P75997 Putative part of putative ATP -binding component of a transport 33 8 AA 
system [bll70] [Escherichia coli K12] 

align 

Score = 118 bits (296) , Expect = 3e-25 

Identities = 76/234 (32%), Positives = 119/234 (50%), Gaps = 19/234 (8%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ -NESNQLS IGLMGGQAEQRSTF 4 68 

LWLR I GH+ +R QLGG++ T +S+ +G+M G A + 

Sbjct : 66 LWLRQIGGHNAWRDSNGQLRTTSHRYVSQLGGDLLTGGFTDSDSWRLGVMAGYARDYNLT 125 

Query: 469 HNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTERFT 527 
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H+ +D + G+V+G+ AG+YATW + GAY DSW QY F++ + ++ E ++ 

Sbjct: 126 HSSVSDYRSKGSVRGYSAGLYATWFADDISKKGAYIDSWAQYSWFKNSVKGDELAYESYS 185 

Query: 52 8 S KG I TAS I E AG YNALLAEH FTKKGNS LRVYLQPQAQLT YLGVNGKF S D S ENAHVNLLGS R 5 87 

+KG T S+EAGY h + F + QPQAQ ++GV+ NAH GSR 

Sbjct: 186 AKGATVSLEAGYGFALNKSFGLEAAKYTWIFQPQAQAIWMGVD HNAHTE ANGS R 239 

Query: 588 QLQTRVG VQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDG 630 

+QTR+G ++ + + S EPF +N + + 4-4-K F V M+G 

Sbjct: 240 IENDANNNIQTRLGFRTFIRTQEKNSGPHGDDFEPFVEMNWIHNSKDFAVSMNG 2 93 



sp Q03155 Adhesin aidA-I precursor [aidA-I] [Escherichia 1286 AA 

AIDA_ECOLI coli] align 

Score = 118 bits (295), Expect = 4e-25 

Identities = 105/414 (25%), Positives = 186/414 (44%), Gaps = 40/414 (9%) 

Query: 3 02 L I FEKNVTGTSD I I FENP IDDL - KSLDGHQ I I KVNGTADKHAFRLSGKHQKGI YTLSLQQ 36 0 

L+ + N +G SDI + + N + + DG II V G +D F L + G Y +LQ+ 

Sbjct: 880 LWKGNTSGQSDIVYVNEDGSGGQTRDGINI ISVEGNSDAE - FSLKNRWAGAYDYTLQK 938 

Query: 3 61 RPEGFLPKCKNAMIWRFMH NRLKP P I P YCALRLNNKNS DIF 401 

E N + H + +P • A + NS + 

Sbjct : 93 9 GNES GTDNKGWYLTSHLPTSDTRQYRPENGS YATNMALANSLFLMDLNERKQFRAMS 995 

Query: 402 DRTLPRKG - LWLRVIDGHSNQWV QGKTAPVEGYRKGVQLGGEVFTWQNES -NQLSIG 456 

DTP +W+++ G S+ + Q KT + QLGG+++ + E ++G 

Sbjct : 996 DNTQPESASVWMKITGGISSGKLNDGQNKTTTNQFIN QLGGDIYKFHAEQLGDFTLG 1052 

Query : 457 LMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHR 516 

+MGG A + N ++ + G+ GVY TW-hQ + TG +A++WMQY F 

Sbjct: 1053 IMGGYANAKGKTINYTSNKAARNTLDGYSVGVYGTWYQNGENATGLFAETWMQYNWFNAS 1112 

Query : 517 INTED - GTERFTSKG ITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSD 575 

+ + E++ G+TAS GYN + + +G + +LQP Q ++GV 

Sbjct: 1113 VKGDGLEEEKYNLNGLTASAGGGYNLNVHTWTSPEGITGEFWLQPHLQAVWMGVTPDTHQ 1172 

Query: 576 SENAH-VNLLGSRQLQTRVGVQA- -KAQFSLYKNIA- - IEPFAAVNALYHNKPFGVEMDG 63 0 

+N V G +QT+ G++A K + +L K+ P+ N +++ FGV+M 

Sbjct: 1173 EDNGT WQGAGKNNI QTKAG I RAS WKVKS TLDKDTGRRFRP Y I E ANW I HNTHE FGVKMSD 1232 

Query : 631 ERRVIN-NKTAIESQLGVAVKIKSHLTLQATFNRQTGKH-HQAKQGALNLQWTF 682 

+ ++ E 4- G+ I +L++ Q G H A GAL + + -4-+F 

Sbjct: 1233 DSQLLSGSRNQGEIKTGIEGVITQNLSVNGGVAYQAGGHGSNAISGALGIKYSF 1286 



tr Q7CPG9 Putative autotransported protein [misL] [Salmonella 955 
typhimurium] AA 



align 



Score = 110 bits (276) , Expect = 7e-23 

Identities = 81/286 (28%), Positives = 142/286 (49%), Gaps = 22/286 (7%) 
Query: 410 LWLRVIDGHS - -NQWVQGKTAPVEGYRKGVQLGGEVFTWQNES -NQLS IGLMGGQAEQRS 466 
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LW+R + H+ N + Y +QLGG++ W + + + IG M G A + + 

Sbjct: 679 LWMRNVGAHTRFNDGSGQLKTRINSYV- -LQLGGDLAQWSTDGLDRWHIGAMAGYANSQN 736 

Query: 467 TFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTER 525 

+ +D + G V G+ G+Y TW+ ++GAY D+WM Y F +++ +D E+ 

Sbjct: 73 7 RTLSSVSDYHSRGQVTGYSVGLYGTWYANNIDRSGAYVDTWMLYNWFDNKVMGQDQAAEK 796 

Query: 526 FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLL- 584 

+ S KG I TAS + E AG Y + L E + +LQP+AQ+ ++GV D+ A+ L+ 

Sbjct: 797 YKSKGITASVEAGYSFRLGESAHQS YWLQPKAQVWMGVQA- -DDNREANGTLVK 849 

Query: 585 - - GS RQLQTRVGVQAKAQ FSLYKNIAIEPFAAVNALYHNKPFGVEMDG-ERRVINN 637 

+ L TR+GV+A K+ + PF N + + + +P V+MD + 

Sbjct: 85 0 DDTAGNLLTRMGVKAYINGHNAIDNDKSREFQPFVEANWIHNTQPASVKMDDVSSDMRGT 909 

Query: 638 KTAIESQLGVAVKIKSHLTLQATFNRQTG - KHHQAKQGALNLQWTF 682 

K E ++G+ ++ L + +Q G + QG L ++++F 

Sbjct: 910 KNIGELKVGIEGQVTPRLNWGNVAQQVGDTGYSDTQGMLGMKYSF 955 



tr Q9Z625 MisL [misL] [Salmonella typhimurium] 955 AA 

align 

Score = 110 bits (276) , Expect = 7e-23 

Identities = 81/286 (28%), Positives = 142/286 (49%), Gaps = 22/286 (7%) 

Query: 410 LWLRVIDGHS- -NQWVQGKTAPVEGYRKGVQLGGEVFTWQNES -NQLSIGLMGGQAEQRS 466 

LW+R + H+ N + Y +QLGG++ W + ++ IG M G A + + 

Sbjct: 679 LWMRNVGAHTRFNDGSGQLKTRINSYV- -LQLGGDLAQWSTDGLDRWHIGAMAGYANSQN 736 

Query: 467 TFHNPDTDMLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED-GTER 52 5 

4- +D + G V G+ G+Y TW+ ++GAY D+WM Y F + + + +'D E+ 

Sbjct: 737 RTLS S VS D YHS RGQ VTG YS VGL YGTW YANN I DRS GAYVDTWML YNWFDNKVMGQDQAAEK 796 

Query: 52 6 FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLL- 584 

+ SKGITAS+EAGY+ L E + +LQP+AQ+ ++GV D+ A+ L+ 

Sb j Ct : 7 97 YKSKGITASVEAGYSFRLGESAHQS YWLQPKAQVWMGVQA- -DDNREANGTLVK 84 9 

Query: 585 - -GS RQLQTRVGVQAKAQ FSLYKNIAIEPFAAVNALYHNKPFGVEMDG-ERRVINN 637 

+ L TR+GV+A K+ +PF N +++ +P V+MD + 

Sbjct: 850 DD TAGNL LTRMG VKAY I NGHNA I DND KS RE F Q P F VE ANW I HNTQ PAS VKMDD VS S DMRGT 90 9 

Query: 63 8 KTAIESQLGVAVKIKSHLTLQATFNRQTG -KHHQAKQGALNLQWTF 682 

K E ++G+ ++ L + +Q G + QG L ++++F 

Sbjct: 910 KNIGELKVGIEGQVTPRLNVWGNVAQQVGDTGYSDTQGMLGMKYSF 955 



S P P4550 8 Hypothetical protein yfaL precursor [yfaL] [Escherichia 1250 

YFAL ECOLI coli] AA 



align 



Score = 96.3 bits (238), Expect = 2e-18 

Identities = 74/279 (26%), Positives = 136/279 (48%), Gaps = 27/279 (9%) 



Query: 412 LRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFT- -WQNESNQLSIGLMGGQAEQRSTFH 469 
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LRVI G + G+ A E VQL G++F+ W + + +G++GG ++ + 

Sbjct : 989 LRVIGGDYHYTAAGQLAQHED-TSTVQLSGDLFSGRWGTDGEWM-LGIVGGYSDNQGDSR 1046 

Query : 470 NPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRIN-TEDGTERFTS 528 

+ T G+ G+ ++W Q +++ GA+ DSW+QY F + ++ EDGT+ + S 

Sbjct: 1047 SNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHS 1106 

Query: 52 9 KG I TAS I E AG YNALL AEHFTKKGNS LRVYLQPQAQLT YLGV - NGKFSDS ENAHVNLLGS R 587 

GI AS+EAGY L V ++PQAQ+ Y GV F+ + A V+ 

Sbjct: 1107 SGI IASLEAGYQWLPGR GWIE PQAQVI YQG VQQDDFTAANRARVS QS QGD 1157 

Query : 58 8 QLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINN KTAIESQ 644 

+QTR+G+ ++ + A+ ++ Y+ + P E + + + I++ K E + 

Sbjct: 1158 DIQTRLGLHSEWR TAVHVI PTLDLNYYHDPHS TE I EEDGS T I S DD AVKQRGE I K 1211 

Query: 64 5 LGVAVKI KSHLTLQATFNRQTGKHHQAKQG ALNLQW 680 

■+GV I + Q G A+ ++ ++W 

Sbjct: 1212 VGVTGMISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW 1250 



tr Q8ZN57 Similar to the C-terminal region of AIDA [shdA] [Salmonella 2039 AA 
typhimurium] 

align 

Score = 95.5 bits (236), Expect = 3e-18 

Identities = 75/289 (25%), Positives = 128/289 (43%), Gaps = 29/289 (10%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFH 469 

+W R . G + + + QLGG++ W N -+--f+G+M + + + 

Sbjct: 1764 WAR F KAGKAE S E AVS GN I DMD SN YS Q FQ LGGD I L AWGNGQQ S VT VG VM ASYI 1816 

Query: 470 NPDTDNL TTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRIN 518 

N DTD+ ++GNV G-f GVYATW +GAY DSW QY + + + 

Sbjct: 1817 NADTDSTGNRGADGSQFTSSGNVDGYNLGVYATWFADAQTHSGAYVDSWYQYGFYNNSVE 1876 

Query : 519 TED - GTERFTSKG I TAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG-KFSDS 57 6 

+ D G+E + S S+E GY +A GN+ V L PQAQ+ + + D+ 

Sbjct: 1877 S GDAGS ES YDS TANAVS LETG YR YD I A LSNGNT - - VS LTPQAQWWQNYS AD S VKDN 1931 

Query: 577 ENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVIN 63 6 

++ TR+G++ + I+PFA N L+ + V D + V 

Sbjct: 1932 YGTRIDGQDGDSWTTRLGLRVDGKLYKGSRTVIQPFAEANWLHTSDDVSVSFD-DATVKQ 1990 

Query: 63 7 MKTA- - IESQLGVAVKIKSHLTLQATFNRQTGKHHQAK-QGALNLQWTF 682 

+ A E ++G+ I +++A QTG + G+LNL++ + 

Sbjct: 1991 DLPANRAELKVGLQADIDKQWSVRAQVAGQTGSNDFGDLNGSLNLRYNW 2039 



tr Q8CW7 Hypothetical protein yfaL [yfaL] [Escherichia coli 1254 AA 

06] align 

Score = 95.5 bits (236), Expect = 3e-18 

Identities = 71/278 (25%) , Positives = 138/278 (49%) , Gaps = 25/278 (8%) 
Query: 412 LRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ -NESMQLS IGLMGGQAEQRSTFHN 470 
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LRVI G + G+ A E VQL G +F+ + + +G++GG ++ + + 

Sbjct : 993 LRVIGGRYHYTAVGQLAQHED - TSTVQLSGNLFSGHWGDDGEWMLGIVGGYSDNQGDSRS 1051 

Query : 4 71 PDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRIN - TEDGTERFTSK 52 9 

T G+ G+ ++W+Q ++ + GA+ DSW+QY F + + + +DG + + S 

Sbjct: 1052 NMTGTRADNQNHGYAVGLTSSWYQHGNQKQGAWLDSWLQYAWFNNDVSEQDDGVDHYHSS 1111 

Query : 53 0 GITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSRQ 588 

GI AS+EAGY L V ++PQAQ+ Y GV F+ + +A V+ 

Sbjct: 1112 GI IASLEAGYQWLPGR GWIEPQAQVIYQGVQQDDFTAANHARVSQSQGDD 1162 

Query: 589 LQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAI ESQL 645 

+QTR+G+ ++ + + +++ P 4-N Y+ + P E+ + + I + + A E + + 

Sbjct: 1163 IQTRLGLHSEWRTA VS VTPTLDLN - - YYHD PHATE IEEDGSTI SDDAAKQRGE I KV 1216 

Query: 646 GVAVKIKSHLTLQATFNRQTGKHHQAKQG ALNLQW 680 

G+ I ++L+ + Q G A+ ++W 
Sbjct: 1217 GITGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW 1254 



tr Q9XCJ4 ShdA [shdA] [Salmonella typhimurium] 2035 AA 

align 

Score = 95.5 bits (236), Expect = 3e-18 

Identities = 75/289 (25%) , Positives = 128/289 (43%) , Gaps = 29/289 (10%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFH 469 

+W R G + + + QLGG++ W N +++G+M +++ 
Sbjct: 1760 VW AR F KAG KAE S E AVS GN I DMD S N Y S Q F Q LG GD I L AWGNGQ Q S VT VG VM ASYI 1812 

Query: 470 NPDTDNL TTG3STVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRIN 518 

N DTD+ ++GNV G+ GVYATW 4- GAY DSW QY + + + 

Sbjct: 1813 NADTDSTGNRGADGSQFTSSGNWGYNLGVYATWFADAQTHSGAYVDSWYQYGFYNNSVE 1872 

Query : 519 TED - GTERFTSKGITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG - KFSDS 576 

+ D G+E + S S+E GY +A GN+ V L PQAQ+ + + D+ 

Sbjct: 1873 SGDAGSESYDSTANAVSLETGYRYDIA LSNGNT- -VSLTPQAQVWQNYSADSVKDN 1927 

Query: 577 ENAHWLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVIN 636 

++ TR+G++ + I+PFA N L+ + V D + V 

Sbjct: 192 8 YGTRIDGQDGDSWTTRLGLRVDGKLYKGSRTVIQPFAEANWLHTSDDVSVSFD-DATVKQ 1986 

Query: 63 7 NKTA- -IESQLGVAVKIKSHLTLQATFNRQTGKHHQAK-QGALNLQWTF 682 

+ A E ++G+ I +++A QTG + G+LNL++ + 

Sbjct: 1987 DLPANRAELKVGLQADIDKQWSVRAQVAGQTGSNDFGDLNGSLNLRYNW 2035 



tr Q8XE2 8 Putative ATP -binding component of a transport system [yfaL] 1250 AA 
[Escherichia coli 0157 :H7] 

align 

Score = 93.2 bits (230), Expect = le-17 

Identities = 72/279 (25%) , Positives = 134/279 (47%) , Gaps = 27/279 (9%) 
Query: 412 LRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFT- -WQNESNQLS IGLMGGQAEQRSTFH 469 
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LRVI G + G+ A E VQL G+ + F+ W + + +G++GG ++ + 

Sbjct: 989 LRVIGGDYHYTAAGQLAQHED-TSTVQLSGDLFSGRWGTDGEWM-LGIVGGYSDNQGDSR 1046 

Query : 470 NPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINT-EDGTERFTS 528 

+ T G+ G+ ++W Q ++ GA+ D+W+QY F + + + EDG + + S 

Sbjct: 1047 SSMTGTRADNQNHGYAVGLTSSWFQHGKQKQGAWLDNWLQYAWFSNDVSEHEDGVDHYHS 1106 

Query: 529 KGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSR 587 

GI AS+EAGY L V ++PQAQ+ Y GV F+ + A V+ 

Sbjct: 1107 SGI IASLEAGYQWLPGR G WI E PQAQVI YQGVQQDD FTAANRARVS QS QGD 1157 

Query : 588 QLQTRVGVQAKAQFSLYKNIAIEPFAAWALYHNKPFGVEMDGERRVINN KTAIESQ 644 

+QTR+G+ + + + A+ + + Y++ P E + + + I++ K E + 

Sbjct: 1158 DIQTRLGLHSEWR T AVH V I P TLDLN Y YHD PH S TE I E ED AS T I S DD AVKQRGE I K 1211 

Query: 645 LGVAVKI KSHLTLQATFNRQTGKHHQAKQG ALNLQW 680 

+GV I ++L+ + Q G A+ ++ ++W 

Sbjct: 1212 VGVTGN I S QR VS LRGS VAWQ KGS DD F AQT AG F L S MTVKW 12 5 0 



tr Q7AC44 Putative ATP -binding component of a transport system 12 5 0 

[ECS3116] AA 
[Escherichia coli 0157 :H7] align 

Score = 93.2 bits (230), Expect = le-17 

Identities = 72/279 (25%), Positives = 134/279 (47%), Gaps = 27/279 (9%) 

Query: 412 LRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFT- - WQNESNQLS IGLMGGQAEQRSTFH 469 

LRVI G + G+ A E VQL G++F+ W + + +G+ +GG + 

Sbjct: 989 LRVIGGDYHYTAAGQLAQHED-TSTVQLSGDLFSGRWGTDGEWM-LGIVGGYSDNQGDSR 1046 

Query: 470 NPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINT - EDGTERFTS 528 

+ T Q+ G+ ++W Q ++ GA+ D+W+QY F + +■+ EDG + + S 

Sbjct: 1047 SSMTGTRADNQNHGYAVGLTSSWFQHGKQKQGAWLDNWLQYAWFSNDVSEHEDGVDHYHS 1106 

Query : 529 KGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSR 587 

GI AS+EAGY L V ++PQAQ+ Y GV F+ + A V+ " 

Sbjct: 1107 SGI IASLEAGYQWLPGR GWIEPQAQVIYQGVQQDDFTAANRARVSQSQGD 1157 

Query: 588 QLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINN KTAIESQ 644 

+QTR+G+ ++ '+ A+ ++ Y++ P E++ + I + + K E + 

Sbjct: 1158 DIQTRLGLHSEWR TAVHVI PTLDLN YYHD PHS TE I EED AST I SDD AVKQRGE I K 1211 

Query: 64 5 LGVAVKI KSHLTLQATFNRQTGKHHQAKQG- - -ALNLQW 680 

+GV I ++L+ + Q G A+ ++ ++W 

Sbjct: 1212 VGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW 1250 



tr Q9FCW0 Hypothetical protein [Kluyvera ascorbata] 652 AA 

align 

Score = 73.9 bits (180), Expect = 9e-12 

Identities = 76/299 (25%), Positives = 123/299 (40%), Gaps = 44/299 (14%) 
Query: 246 NSLWEPRWDSNVGKLKTTNADIRFNTKSES - -LLVKEDYAGG ARFRFAYDPKEAKNT 300 
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NSLW+ DS+LN++S LV+YG R + 

Sbjct: 3 61 NSLWKMTGDSVLSSLTLNNGTVEWAGASAGNVLTVAGNYQGNNGLLRINTVLGDDNSVTD 420 

Query: 301 ALIFEKNVTGTSDIIFENPIDDL-KSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQ 359 

L+ E + +GT+ + N K+++G ++I V+G +D + F SG+ G Y +L+ 

Sbjct: 421 KLWEGDTSGTTFVAVTNAGGSGDKTINGIEVIHVDGASDGN-FVQSGRIVAGSYEYALR 479 

Query: 360 QRPEGFLPKCKNAMIWRFMHNRLKP- -PIP YCALRLNNKNSD I FDRTLP 406 

R EG W M+ P P+P Y A L N+ R 

Sbjct: 480 -RGEGTAFNH -WYLMNAATTPEEPVPEKPTLRPESGSYIA-NLAAANTMFTTRLHD 532 

Query: 407 RKG LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQ-NE 44 9 

R G +W+R + GH+ + R +QLGG++ W N 

Sbjct: 533 RLGETQYIARLTGEQKVTSMWMRNVGGHTRFKDSSGQLSTQSNRYVLQLGGDIAQWSTNG 592 

Query: 450 SNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWM 50 8 

++ +GLM G +S +• + + G V G+ AG+Y TW+ + +TG+Y DSW+ 

Sbjct: 593 LDRWHLGLMAGYGNSQSNTKSDVSRYHSRGQVTGYSAGIYGTWYANEADKTGSYLDSWI 651 



tr Q6IU25 YdeU (Fragment) [Escherichia coli B] 169 AA 

align 

Score = 70.5 bits (171), Expect = le-10 

Identities = 39/138 (28%) , Positives = 63/138 (45%) , Gaps = 11/138 (7%) 

Query: 3 71 NAMIWRFMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPV 43 0 

N+ + +H+RL P + +D +W+R + GH 

Sbjct: 41 NSLFSHRLHDRLGEP Q YTD S LHS QGS AS SMWMRHVGGHERS RAGDGQLNT 90 

Query: 431 EGYRKGVQLGGEVFTWQ-NESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVY 489 

+ R +QLGG++ W N ++ +G+M GAQS + + G + G+ AG+Y 

Sbjct: 91 QANRYVLQLGGDLAQWSSNAQDRWHLGVMAGYANQHSNTQSNRVGYKSDGRISGYSAGLY 150 



Query: 4 90 ATWHQLQDKQTGAYADSW 5 07 

ATW+Q +TGAY DSW 
Sbjct: 151 AT W YQND ANKTGAYVD S W 168 



tr Q88LP9 Outer membrane autotransporter [PP1880] [Pseudomonas putida 73 0 

(strain AA 
KT2440)] align 

Score = 66.2 bits (160), Expect = 2e-09 

Identities = 109/474 (22%), Positives = 183/474 (37%), Gaps = 84/474 (17%) 

Query: 228 DNSHVIGQTIS LDEF RLENSL WEPRWDSNVGKLKTTN 264 

+NSH++G ++ L+ F RL+N W +S V L N 

Sbjct: 2 57 mSHLVGDIVAASGGTAJSTVLLENFATLKGRLDNVASLEINSGGEWTLVDNSQVTDLSLDN 316 

Query: 265 ADIRFNTKSESLLVK-EDYAGGARFRFAYDPKEAKNTALIFEKNVTGTSD 1 1 FEN 318 

+RF E + E+ G F D +++ L +VTGT+ +1 + 

Sbjct: 317 GAVRFGGPGEFFTLSVENLTGNGTFIMEADFSTSQSDFL D VTGTAS GNHQLL I S AS 372 

Query: 319 PIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQRPEG FLPK 3 68 
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DL H + GD LG GY+LQR+ p 

Sbjct: 373 GNDPLTDNSLHWHTAAG- -DSQFSLLGGSVDLGAYSYDLVQRGDNDWYLDATTRTVSPG 43 0 

Query: 3 69 CKNAM I WRFMHNRLKP P I P YCALRLNNKNS D I FDRTLPRKGLWLRVI DGHSNQWVQGKT - 42 7 

+ M +PIYL+ R+G W+R + NQ+ T 
Sbjct: 431 TQTVMALA NWPTIWYGELGVLRSRMGDVRRNPGKAGGWVR SYGNQFNVSATS 483 

Query: 428 -APVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGN VKG 483 

A + ++G+ +G + + N L +G+ G + N D NL G+ V 

Sbjct: 484 GAAYQQQQQGLSIGADAPLAAGDGNWL - VGITAG YSNSDL -NLARGSSASVDS 534 

Query: 484 FGAGVYATWHQLQDKQTGAYADSWMQYQRFRHR- -INTEDGTER FTS KG I TAS I EAG 538 

+ AG YATW D ++G Y D+ + RFR-f + + DG+ + ++-4- G S+E G 

Sbjct: 535 YHAGAYATW LD PE S GYY I DT VAR INRFRNQAD VRLSDGS KAKGD YSNLGAG VSLE VG 591 

Query: 53 9 YNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK-FSDSENAHVNLLGSRQLQTRVGVQA 597 

+ LA+ 4- +L+P AQL+ L V GK +S N + L +VG 

Sbjct: 592 RHLNLADDW FLEPFAQLSGLWQGKDYSLDNGMRANSNSTHSLLGKVGTSV 642 

Query: 5 98 KAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQLGVAVKI 651 

FS + + + P+ V A++ + R +N 4- ++G V + 

Sbjct: 643 GRTFS AGTGRS VQP YLRVAAVHE F VNDNQVKVNDNRF S SNLAGS RAE I GAG VAV 696 



tr Q884S7 Autotransporter, putative [PSPTO2011] [Pseudomonas syringae 769 
(P v - ~ AA 

tomato) ] ali gn 

Score = 66.2 bits (160), Expect = 2e-09 

Identities = 105/460 (22%), Positives = 184/460 (39%), Gaps = 58/460 (12%) 

Query: 24 9 WEPRWDSNVGKLKTTNADIRFNTKSESLLVKEDYAGGARFRFAYDPKEAKNTALIFEKMV 3 08 

W+ + D+VL +F + L + +G F DA + 

Sbjct: 342 WQMQGDNAVTSLSMQGGSVGFGGEGFHTLSLNELSGSGTFGLRVDLDNAVGDLINVNGQA 401 

Query: 309 TGTSDIIFENPIDDLKSLDGHQIIKVNGT- -ADKHAFRLSGKHQKGIYTLSLQQRPEGFL 366 

+ N S D Q +KV T D L G+ G Y+ L+Q+ + 

Sbjct: 402 SGQFGLRVRNTGVEVISAD-MQPLKVVHTEGGDAQFSLLGGRVDLGAYSYLLEQQGNDWF 460 

Query: 367 PKCKNAMIWRFMHNRLK PPIPYCALR-LNNKNSDIFDRTLPRKGLWLRVIDGHSN 420 

+1 + L PI L L ++ ++■ R + G W+R N 

Sbjct: 461 WGRDKVISPSTQSALALYSAAPAIWMSELSTLRSRMGEV- -RASGQAGGWMRAYGSRLN 518 

Query: 421 QWVQGKTAPVEGYRK GVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLT 477 

T+ YR+ G+ LG 4- -i- + +L +G++GG + + D T 

Sbjct: 519 ATTSDGVDYRQKQSGLSLGADAPV-EVSNGRLLVGVLGGYSTS -DLDVSRGT 568 

Query: 47 8 TGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHR- - INTEDGTER FTS KG IT 532 

TG V + AG Y TW L D G Y D ++ RFR++ + D ++ +++ G+ 

Sbjct: 569 TGKVASYYAGAYGTW- -LSD- -DGYYVDGVLKLNRFRNKADVAMSDASKAKGDYSNTGVG 624 

Query: 53 3 ASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTR 592 

+EAG + LA+ + +L+P AQL+ + V G+ +N + ++ +TR 

Sbjct: 625 GWVEAGRHI KLADDY FLE P FAQL S S VWQGQELRLDNG MKAKNARTR 671 

Query: 593 VGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQL 645 
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V K SL + +A+ + V Y E V N ++ L 

Sbjct : 672 -SVLGKVGTSLGRTVALKD-GGVLQPYVRVAVAQEFSRRNEVKANDVKFDNSLFGSRGEL 72 9 

Query: 646 - -GVAVKIKSHLTLQATFNRQTGKHHQAKQGA-LNLQWTF 682 

GV+V + L L A + G+H + GA + L+ TF 
Sbjct: 730 GAGVSVSLSERLKLHADVDYMKGQHIEQPWGANVGLRLTF 769 



tr Q7VYJ2 Autotransporter [BP1344] [Bordetella pertussis] 866 AA 

align 

Score = 65.5 bits (158), Expect = 3e-09 

Identities = 79/308 (25%) , Positives = 131/308 (41%) , Gaps = 46/308 (14%) 

Query: 3 97 NSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAP-VEGYRKGVQLGGEVFTW-QNESNQLS 454 

N + R R W R G+S Q + G P ++ GVQLG ++++ + + Q 
Sbjct: 5 83 NQALLARDGER VAAWARAYGGNS KQALDGDAQPG I DARLAGVQLGQDL YS S VRPDGGQHR 642 

Query: 455 IGLMGGQAEQRSTFHNP- -DTDNLTTGN- -VKGFGAGVYATWHQLQDKQTGAY-ADSWMQ 509 

GL GG + R H + TG + G+ G Y T+ 4- A A++WM 

Sbjct: 64 3 FGLFGGYGQARGDTHGSAGGERDAATGRLTIDGYSVGGYWTYVGPRGWYVDAVLANTWMD 7 02 

Query: 510 YQRFRHRINTEDGTERFT - SKGITAS I EAGYNALLAEHFTKKGNSLRVYLQPQAQLTY - - 566 

+++ G + T .+ TAS+E+GY L+E +T L+PQAQL Y 

Sbjct: 703 IDT DS KAGRD ADTRGQAFTAS LE S G YPLALS ERWT LEPQAQLIYQH 748 

Query: 567 LGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAWALYHNKPFGV 626 

V+G FSD+ + V + L R+G + + +++ + P+AA+M F 
Sbjct: 749 TRVDG - FSDAV- SEVRIRDDNALTARLGARLQGEYAAAAQV- WRPYAALN FWR 798 

Query: 627 EMDGERRVI NNKTAI E S QLGVAVKI KSHLTLQATFNRQTGKHHQ AKQGA - - 675 

GE V+ T+ + E G +V + L L T Q +GA 

Sbjct: 799 TFSGENTWLGEDS IDTRRGATSLELAAGASVTLARSLALYGRLAYATS IDSQYLRGASA 858 

Query: 676 -LNLQWTF 682 
Sbjct: 859 QLGMRYTW 866 



tr Q7WIM1 Putative autotransporter [BB2830] [Bordetella bronchiseptica 849 AA 
(Alcaligenes bronchisepticus) ] 

align 

Score = 64.3 bits (155), Expect = 7e-09 

Identities = 75/304 (24%), Positives = 125/304 (40%), Gaps =38/304 (12%) 

Query: 397 NSD I FDRTL PRKGLWLR VI DGHSNQ WQGKTAP - VEGYRKGVQLGGE VFTW - QNE SNQL S 454 

N + R R W R G+S Q + G P ++ GVQLG ++++ + + Q 
Sbjct: 566 NQALLARDGERVAAWARAYGGNS KQALDGDAQPG IDARLAGVQLGQDLYSS VRPDGGQHR 625 

Query: 4 55 IGLMGGQAEQRSTFHNP- -DTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQR 512 

GL GG R H +TG+ GV W + + GYD+ + 

Sbjct: 626 FGLFGGYGHARGDTHGSAGGERDAATGRLTIDGYSVGGYWTYVGPR- -GWYVDTVLANTW 683 

Query: 513 FRHRINTEDGTERFT- SKGITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN- 570 
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+++ G + T + +TAS+E+GY L+E +T L+PQAQL Y 

Sbjct: 684 MD I DTDS KAGRDADTRGQAL TAS LE S G Y PL AL S ERWT -LEPQAQLIYQHTRV 734 

Query: 571 GKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDG 63 0 

FSD+ + V 4- L R+G + + + + + + P+AA+N F G 

Sbjct: 735 DDFSDAV - SEVRIRDDNALTARLGARLQGE YAAAAQV - WRP YAALN FWRTFSG 785 

Query: 631 ERR VI NNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGA LNL 678 

E V+ ' T++E G +V + L L T Q +GA L + 

Sbjct: 786 ENTWLGEDSIDTRRGATSLELAAGASVTLARSLALYGRLAYATS IDSQYLRGASAQLGM 845 

Query: 679 QWTF 682 
Sbjct: 84 6 RYTW 84 9 



tr Q883X2 Autotransporter, putative [PSPT02225] [Pseudomonas syringae 773 
(PV. AA 
tomato) ] align 

Score = 63.9 bits (154), Expect = 9e-09 

Identities = 69/283 (.24%), Positives = 116/283 (40%) r Gaps = 39/283 (13%) 

Query: 4 07 RKGLWLRVIDGHSNQWVQGKTAPVEGYRK GVQLGGEVFTWQMESNQLS IGLMGGQAE 463 

+ G W+R N A GY+ + G LG + + L+ G+M GQ+ 

Sbjct: 508 QSGAWMRTYGNKFNV SDAS GFGYQQTQQGFS LGADGKVPMGDGQWLA - GVMAGQS S 562 

Query: 4 64 QRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRH- -RINTED 521 

+ D +G V + G Y+TW D TG Y D + + RF + R+N D 
Sbjct: 563 SDLSL DRGASGKVDS YYVGAYSTW LDSDTGYYFDGVLKFNRFNNKARVNLSD 614 

Query: 522 GTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSEN 578 

GT +++ G+ AS+E G + L + +++P +QL + V GK + +N 

Sbjct: 615 GTRTKGD YSNS GVGAS LE FGRH I KLDNG Y FVEPYSQLAGWVEGKDYELDN 665 

Query: 579 AHWLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVI 635 

A +L +R L ++G F L + ++P+ AH E+ V 

Sbjct: 666 GMRAENDL - - TRSLVGKLGATTGRNFDLGQGRTVQPYVR - TAWVHEFAKNNEVQVNDNVF 722 

Query: 636 NNK TAI E S QLG VAVKI KSHLTLQAT FMRQTGKHHQAKQGA 675 

NN + E +G+A + + A F G + GA 

Sbjct: 723 NNDLSGSRGELGIGIAASLSERFQVHADFEHSNGDKVEQPWGA 765 



tr Q7WIB0 Putative autotransporter [BB2941] [Bordetella bronchiseptica 937 AA 
(Alcaligenes bronchisepticus) ] 

align 

Score = 55.5 bits (132), Expect = 3e-06 

Identities = 106/493 (21%), Positives = 170/493 (33%), Gaps = 76/493 (15%) 

Query: 224 IEVTDNSHVIGQTISLDEFRL-ENSLWEPRWDSNVGKLKTT-NADIRFNTKSE SLLV 278 

+4-V D + G T SLD L W S+VG+L A + F + L 

Sbjct: 477 LDVADGAQWHGATQSLDRLALGAGGQWRMSAASSVGELSMEPGAAWFGDAAGPGFQTLT 536 
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Query: 279 KED YAGGARFRFAYDPKEAKNTAL I FEKNVTGTSD I I FENP IDDLKSLDGHQI I KVNGTA 338 

AG F D L+ G+P SQ-f-VA 

Sbjct: 537 VRTLAGAGSFEMRADAALEHADQLWTDQAEGRHRVWLRAPAGAEPSKA- - QAVLVRAPA 594 

Query: 339 D - KHAFRLSGKHQK GIYTLSLQQRPEGFLPKCKNAM IWR 376 

DK+FLG + GY L Q+P G + +W 

Sbjct: 595 DGKASFELDGSDGRADFGTYRYGLAQQPGGAWGLVRTGYSSTAAAALDTGGLGAVQGLWY 654 

Query: 377 FMHNRLKPPIPYCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKG 436 

ML 4- LRLN + R ++ R+ + QG + G 
Sbjct: 655 AESNALGKRMG- -ELRLNPDAGGAWGRAFSQRQ RISPRAGRHFQQGVS G 701 

Query: 437 VQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQ 496 

++LG + W +G+G F +++VG++A 
Sbjct: 702 IELGADR-AWPVAGGRWHAGGLLGYTRASRGFSGQGKGHTDSVHVGGYATYIGAN 755 

Query: 497 DKQTGAYADSWMQYQRFRHRINTEDGTER FT S KG I TAS I EAG YNAL LAEH FTKKG 551 

G YAD+ ++ RF + + R + + G+ ++EAG L H+ 
Sbjct: 756 GVYADATLRASRFENSFDVPGWAGRTVSGSYRANGVGVTLEAGRRLALDRHW 807 

Query: 552 NSLRVYLQPQAQLTYLGVNG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIE 610 

+++PQA+L + G ++ S + G LQ RVG QA +F L 4- + 

Sbjct: 808 FVEPQAELAWFRAGGGTYTASNGLRIEDDGGTSLQARVGAQAGRRFDLRGGAWQ 862 

Query: 611 PFAAVNALYHNKPFG-VEMDG-ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH 668 

P+A ++ + K V +G R +E LGVA + L A++ G 

Sbjct: 863 PYAQLSWVQELKGVSTVRTNGIAHRTDLGAGRVELGLGVAAALGKGHNLYASYEYAHGPR 922 

Query: 669 HQAKQGALNLQWT 681 

L+L WT 

Sbjct: 923 ---LSLPWT 92 8 



tr Q7W6 E1 Putative autotransporter [BPP2 975] [Bordetella parapertussis] 93 7 AA 

align 

Score = 5.5.5 bits (132), Expect = 3e-06 

Identities = 106/493 (21%), Positives = 170/493 (33%), Gaps = 76/493 (15%) 

Query: 224 IEVTDNSHVIGQTISLDEFRL-ENSLWEPRWDSNVGKLKTT-NADIRFNTKSE SLLV 278 

++V D + G T-SLD L W S+VG+L A + F + L 

Sbjct: 477 LDVADGAQWHGATQSLDRLALGAGGQWRMSAASSVGELSMEPGAAWFGDAAGPGFQTLT 53 6 

Query: 279 KEDYAGGARFRFAYDPKEAKNTAL I FEKNVTGTSD 1 1 FENP IDDLKS LDGHQ 1 1 KVNGTA 338 

AGFD L+ Q+P SQ + VA 

Sbjct: 537 VRTLAGAGSFEMRADAALEHADQLWTDQAEGRHRVWLRAPAGAEP SKA- -QAVLVRAPA 594 

Query: 339 D- KHAFRLSGKHQK GIYTLSLQQRPEGFLPKCKNAM IWR 376 

DK+FLG + GY L Q+P G 4- +W 

Sbjct: 595 DGKASFELDGSDGRADFGTYRYGLAQQPGGAWGLVRTGYSSTAAAALDTGGLGAVQGLWY 654 

Query: 377 FMHNRLKPPI PYCALRLNNKNSD I FDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKG 436 

N L + LRLN + R R+ 4- QG + G 
Sbjct: 655 AESNALGKRMG- -ELRLNPDAGGAWGRAFSQRQ- - -RISPRAGRHFQQGVS G 701 
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Query: 437 VQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQ 496 

++LG 4- W 4- G 4- G F 4- 4- 4-V G4- 4- A 
Sbjct: 7 02 IELGADR-AWPVAGGRWHAGGLLGYTRASRGFSGQGKGHTDSVHVGGYATYIGAN 755 

Query: 4 97 DKQTGAYADSWMQYQRFRHRINTEDGTER FTSKGITAS IEAGYNALLAEHFTKKG 551 

G YAD4- 4-4- RF + + R + + G + 4-4-EAG L H+ 
Sbjct: 756 GVYADATLRASRFENSFDVPGWAGRTVSGSYRANGVGVTLEAGRRLALDRHW 807 

Query: 552 NSLRVYLQPQAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNI AIE 610 

++ + PQA4-L -f G -1-4- S + G LQ RVG QA 4-F L 4-4- 
Sbjct: 808 F VE PQAELAWFRAGGGTYTASNGLR I EDDGGTS LQARVGAQ AGRRFDLRGGAWQ 862 

Query: 611 PFAAVNALYHNKPFG-VEMDG-ERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTGKH 668 

P+A ++ + K V +G R +E LGVA 4- L A+ + G 

Sbjct: 863 PYAQLSWVQELKGVSTVRTNGIAHRTDLGAGRVELGLGVAAALGKGHNLYASYEYAHGPR 922 

Query: 669 HQAKQGALNLQWT 681 

L+L WT 

Sbjct: 923 LSLPWT 928 



tr Q7WQ8 0 Autotransporter [BB0452] [Bordetella bronchiseptica 2377 
(Alcaligenes AA 
bronchisepticus) ] align 

Score =53.9 bits (128), Expect = le-05 

Identities = 58/257 (22%), Positives = 103/257 (39%), Gaps = 29/257 (11%) 

Query : 408 KGLWLRVIDGHSNQWVQGKTAP - VEGYRKGVQLGGEVFTWQNESNQL -S IGLMGGQAEQR 465 

4-G W R + G+ AP +G+ G QLG +4-+ + + 4- G+ GG A R 

Sbjct: 2103 RGGWARTFGRRFERSAGGEAAPSFDGHLAGAQLGADLYARSSATRHTDAFGVFGGYATVR 2162 

Query: 466 STFHN PDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG 522 

H + 4-T+4- GYT GYD++ R-hR 4- + 

Sbjct: 2163 GDVHGLARGE IQAVGTSTLRATQLGAYWT HTGPGGW Y I DTVLAGTR YRQQTKS S AQ 2218 

Query : 523 TERFTS K - G I TAS I EAG YNALLAEHFTKKGNS LRVYLQ PQAQLT YLGVNGKF SD S ENAHV 581 

+ G+TAS+EAGY L + R 4-4-PQAQ4- Y 4- 4- V 

Sbjct: 2219 VGAVSRGWGMTASVEAGYPWQL NPRWRIEPQAQWYQQLGIANGADRVSTV 2269 

Query: 582 NLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY HNKPFGVEMDGER 632 

4- L R4-G + Q++ Y + PF V+ L+ GV + 

Sbjct: 2270 SYKTPDALTARLGTRLSGQYA-YGKAQLRPFMGVSLLHDFTGADTVTFAGAHGVRASRQN 2328 

Query: 63 3 R V I NNKT AI E S QLG VAV 64 9 

+ 4- K +4-4-QLG +V 
Sbjct: 2 32 9 TAVDLKAGVDTQLGKSV 2345 



tr Q7WHU5 Autotransporter [BB3111] [Bordetella bronchiseptica 52 8 

(Alcaligenes AA 
bronchisepticus ) ] align 

Score =53.5 bits (127), Expect = le-05 

Identities = 55/220 (25%), Positives = 89/220 (40%), Gaps = 42/220 (19%) 
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Query: 478 TGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTER FTSKGIT 532 

TG G YAT+ + GYD++R+H DR ++G+ 

Sbjct: 32 8 TGEGD S VH VG A Y AT Y I E DGGFYMDGIVRVARYAHEFKAPDAKGRRVRGKYRANGVG 3 83 

Query: 533 ASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLT-YLGVNGKFSDSENAHVNLLGSRQLQT 591 

AS+E G + FT G Y-f+PQ + + + G + ++ V G L 

Sbjct: 3 84 ASLELG KRFTWAG AWYVEPQLEMAVFHAQGGSYRAGQDLRVKDNGLTSLLG 434 

Query: 5 92 RVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRV INNKTAI E 642 

RVG-f QF L + + P+A + + L E DG+ V + +K+ + E 

Sbjct: 435 RVGLHVGRQFDLGDGRWQPYAKLSWLQ EFDGKNTVRTNGVRHKSRLDGGRAE 487 

Query: 643 S QLG VAVK I KS HLTL Q AT FNRQTGKHHQAKQG ALNL Q WT F 682 

LGVA + + H +L G + AK + WTF 

Sbjct: 488 LDLGVAAQLGKHGSLY GSYEYAKGSRQTMPWTF 520 



tr Q7WC76 Autotransporter [BPP0452] [Bordetella 1769 AA 

parapertussis] align 

Score =53.5 bits (127), Expect = le-05 

Identities = 58/257 (22%), Positives = 103/257 (39%), Gaps = 29/257 (11%) 

Query: 408 KGLWLRVIDGHSNQWVQGKTAP-VEGYRKGVQLGGEVFTWQNESNQL-SIGLMGGQAEQR 465 

+G W R + G+ AP +G+ G QLG ++ + + + + G+ GG A R 

Sbjct: 14 95 RGGWARTFGRRFERSAGGEAAPSFDGHLAGAQLGADLYARSSGTRHTDAFGVFGGYATVR 1554 

Query: 466 S TFHN PDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADS WMQYQRFRHR INTEDG 522 

H GYT GYD++ R+R + + 

Sbjct: 1555 GDVHGLARGE IQAVGTSTLRATQLGAYWT HTGPGGWY I DTVLAGTR YRQQTKS S AQ 1610 

Query: 523 TERFTSK-GITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHV 581 

+ G-f T AS+E AG Y L + R + + PQAQ+ Y + + V 

Sbjct: 1611 VGAVSRGWGMTASVEAGYPWQL NPRWR I E PQAQ WYQQLGI ANGADRVS TV 1661 

Query: 582 NLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVMALY HNKPFGVEMDGER 632 

+ L R+G + Q++ Y + PF V+ L+ GV + 

Sbjct: 1662 SYKTPDALTARLGTRLSGQYA-YGKAQLRPFMGVSLLHDFTGADTVTFAGAHGVRASRQN 1720 

Query: 633 R V I NNKT A I E S QLG VAV 64 9 

++ K +++QLG +V 
Sbjct: 1721 TAVDLKAGVDTQLGKSV 1737 



tr Q8W95 Vag8 protein (Autotransporter) [vag8-2] [Bordetella pertussis] 915 AA 

align 

Score =52.0 bits (123), Expect = 4e-05 

Identities = 61/252 (24%) , Positives = 100/252 (39%) , Gaps = 42/252 (16%) 

Query: 43 6 GVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQL 4 95 

G++LG + + S+G++ G +E R D G+V G Y ++ L 

Sbjct: 678 G I ELGLDRR VAGGATTAWS VGMLAGYS ETRR DGGAYRAGHVHSAHVGAYVSY- -L 730 
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Query: 496 QDKQTGAYADSWMQYQRFRH- -RINTED GTERFTS KG I TAS I E AGYNALL AEHFTKK 550 

D +G+Y D ++Y RFRH ITD +SG+A+G + + 
Sbjct: 731 ND--SGSYVDGWKYNRFRHGFDIRTTDLKRVDAKHRSHGLGALLRGGRRIDIDGGW 785 

Query: 551 GNS LRVYLQPQAQLT YLGVNG - KFSDS ENAHVNLLGS RQLQTRVGVQAKAQF S L YKN I A I 609 

Y++PQA + + G++S VG+ RG+AQL + 

Sbjct: 786 YVEPQASVAWFHAGGSRYEASNGLRVRADGAHSWVLRAGAEAGRQMRLANGNIV 839 

Query: 610 EPFA AWALYHNKPFGVEMDGERRVINNKTAI-ESQLGVAVKIKSHLTLQA 659 

EP+A A NA+Y N G R V ++ E+ + +GV + LA 

Sbjct: 84 0 EPYARLGWAQELGADNAVYTN G I RHVTRS RGGFAE ARVGVGALLGKRHAL YA 8 91 

Query: 660 TFNRQTGKHHQA 671 

+ G +A 
Sbjct: 892 DYEYAKGARFEA 903 



tr 066044 Vag8 [vag-8] [Bordetella pertussis] 915 AA 

align 

Score =52.0 bits (123), Expect = 4e-05 

Identities = 61/252 (24%) , Positives = 100/252 (39%) , Gaps = 42/252 (16%) 

Query: 436 GVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQL 495 

G++LG + + S+G+ + G +E R D G+V G Y ++ L 

Sbjct: 678 G I ELGLDRR VAGGATTAWS VGML AG YS ETRR DGGAYRAGHVHSAHVGAYVSY- -L 73 0 

Query: 496 QDKQTGAYADSWMQYQRFRH- -RINTED GTERFTSKGITASIEAGYNALLAEHFTKK 550 

D +G+Y D -f+Y RFRH ITD +SG+A+G + + 
Sbjct: 731 ND--SGSYVDGWKYNRFRHGFDIRTTDLKRVDAKHRSHGLGALLRGGRRIDIDGGW 785 

Query: 551 GNS LRVYLQPQAQLT YLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAI 609 

Y++PQA Q++S V G+ ■ R G +A Q L + 

Sbjct: 786 YVEPQASVAWFHAGGSRYEASNGLRVRADGAHSWVLRAGAEAGRQMRLANGNIV 839 

Query: 610 EPFA AVNALYHNKPFGVEMDGERRVINNKTAI -ESQLGVAVKIKSHLTLQA 659 

EP+A A NA+Y N G R V ++ E + 4-4-GV + LA 

Sbjct: 840 EPYARLGWAQELGADNAVYTN G I RHVTRS RGGFAE ARVGVGALLGKRHAL YA 891 

Query: 660 TFNRQTGKHHQA 671 

+ G +A 
Sbjct: 892 DYEYAKGARFEA 903 



tr Q7WQ83 Autotransporter [BB0450] [Bordetella bronchiseptica 2152 
(Alcaligenes AA 
bronchisepticus) ] align 

Score = 51.6 bits (122), Expect = 5e-05 

Identities = 47/220 (21%), Positives = 88/220 (39%), Gaps = 23/220 (10%) 

Query : 407 RKGLWLRVIDGHSNQWVQGKTAP - VEGYRKGVQLGGEVFTWQNESNQLS I -GLMGGQAEQ 464 

R+G W R + G P +G+ G+Q G +++ +++ + G+ GG A 

Sbjct: 1876 RRGAWTRAFGRTFERSGSGDVDPRFDGHVAGLQAGVDLYARRSDQGHADLAGVFGGYANA 1935 
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Query: 465 RSTFHNPDTDNLTTGNVKGFGA GVYATWHQLQDKQTGAYADSWMQYQRFRHRI -N 518 

R D G + Y + . -fG Y D+ + R+ + + 

Sbjct: 1936 RGHM DGFARGETGAYAGKPDLNAY YI GG YWTH I GPS GW YVDAVL AGTR YEQKAKS 1990 

Query : 519 TEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSEN 57 8 

+ D G+TAS+EAGY + E + +++PQAQL Y + + + 

Sbjct: 1991 SNDLRTEAKGWGVTASVEAGYPVPIGEKW HIE PQAQL VYQRLTVSNGEDD V 2041 

Query: 579 AHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNAL 618 

+ V+ + R+G + Q++ Y + PF V+ L 

Sbjct: 2042 SSVSYRTPDSVTARLGARLSGQYA-YNTTQLRPFMEVSLL 2080 



tr Q7WC79 Autotransporter [BPP0449] [Bordetella 1616 AA 

parapertussis] align 

Score = 51.6 bits (122), Expect = 5e-05 

Identities = 47/220 (21%), Positives = 88/220 (39%), Gaps = 23/220 (10%) 

Query: 407 RKGLWLRVIDGHSNQWVQGKTAP - VEGYRKGVQLGGEVFTWQNESNQLSI -GLMGGQAEQ 464 

R+G W R + G P +G+ G+Q G + + + + + + + G+ GG A 

Sbjct: 1340 RRGAWTRAFGRTFERSGSGDVDPRFDGHVAGLQAGVDLYARRSDQGHADLAGVFGGYANA 1399 

Query: 465 RSTFHNPDTDNLTTGNVKGFGA GVYATWHQLQDKQTGAYADSWMQYQRFRHRI -N 518 

R D G + Y + +GYD++ R+ + + 

Sbjct: 1400 RGHM DGFARGETGAYAGKPDLNAYY I GG YWTH I GPS GWYVDAVLAGTR YEQKAKS 1454 

Query: 519 TEDGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSEN 578 

+ D G+TAS+EAGY + E + " +++PQAQL Y + + + 

Sbjct: 1455 SNDLRTEAKGWGVTASVEAGYPVPIGEKW HIE PQAQL VYQRLTVSNGEDD V 1505 

Query: 579 AHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNAL 618 

+ V+ + R+G + Q++ Y + PF V+ L 

Sbjct: 1506 SSVSYRTPDSVTARLGARLSGQYA- YNTTQLRPFMEVSLL 1544 



tr Q7W9Y1 Autotransporter [BPP1618] [Bordetella parapertussis] 519 AA 

align 

Score = 51.6 bits (122), Expect = 5e-05 

Identities = 57/243 (23%), Positives = 92/243 (37%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 3 01 VGAVAG YTNGR I KF DRGGTGDDD S VHVGAYAT YI E DGGF YMDG I VRVS R I R 351 

Query: 515 HRINTEDGTER FTS KG I TAS I EAG YNALLAEHFTKKGNS LRVYLQPQAQLT YLGV 569 

H +D R + G+ AS+E G + FT G Y-+ + PQ + + 

Sbjct: 352 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFRA 402 

Query: 570 NG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 628 

G ++ S + G+ + R+G+ QF L ++ p + ++ + e 
Sbjct: 403 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 455 
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Query: 62 9 DGERRVINNKT AIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 679 

DG+ V N E LGVA ++ H +L G + AK + 

Sbjct: 456 DGKGTVRTNGVRHKNRLDGGRAELDLGVAAQLGKHGSLY GSYEYAKGSRQTMP 508 

Query: 680 WTF 682 
WTF 

Sbjct: 509, WTF 511 



tr Q9F290 YapC protein (Putative autotransporter) (Putaive 63 8 

autotransporter AA 
protein) [yapC] [Yersinia pestis] align 

Score = 51.2 bits (121), Expect = 6e-05 

Identities = 39/205 (19%) , Positives = 90/205 (43%) , Gaps = 22/205 (10%) 

Query: 409 GLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTF 468 

G+W R + S + K + G+ + +G + N N L + +' 

Sbjct: 382 GVWARYLTDDS-RLSDNKNIAFKNTLSGMEIGADKQLGLNRGNMLIGAFTSYSSSDVKST 440 

Query: 469 HNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERFTS 528 

H+ + G+ + + +G G+Y T+ Q+G Y D+ ++ RF +++NT++ + 

Sbjct: 441 HDAN GD I RS YGGGLYLTYLD QSGFYVDTVLKANRFNNKMNTQETRGEYNQ 490 

Query: 52 9 KGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV-NGKFSDSENAHVNLLGSR 587 

+T S+E+GY + + + L+P +++Y + + ++ S + + 

Sbjct: 491 NALTTSVESGYQWPVYAN LVLEPYGKVSYSRIGSADYTLSNGMVAEVAKAD 541 

Query: 588 QLQTRVGVQAKAQFSLYKNIAIEPF 612 

+Q +G A +S + + I+P+ 
Sbjct: 542 SVQGELGTVLAASYSI -NQMTIKPY 565 



tr Q88ID0 Outer membrane autotransporter [PP3069] [Pseudomonas putida 825 
(strain AA 
KT2440)] t align 

Score =50.8 bits (120), Expect = 8e-05 

Identities = 57/222 (25%) , Positives = 97/222 (43%) , Gaps = 36/222 (16%) 

Query: 477 TTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG-TERFTSKGITASI 535 

+ G++K G V W + + GAY D+ +QY R R +E G T TAS+ 
Sbjct: 624 SVGDLKLDGDSVGTYWTLVGPQ- -GAYLDAVLQYTRLDGRARSERGDTLNLDGHAWTASL 681 

Query: 53 6 EAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGV 5 95 

E+GY L+E R + +-PQAQL V + + + ++ +L R+G+ 

Sbjct: 682 ESGYPITLSE RWRVEPQAQLIAQKVALESARDSVSRISHDAQVELTGRLGL 732 

Query: 596 QAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVIN NKTAIESQ 644 

+ + F+ ++PFA VN L+H DG R + T+++ + 

Sbjct: 733 RLEGAFTGS SGRLLQPFAQVN - LWHG DGGRDTLTFDDADKI KTD YRYTS VQLE 784 

Query: 645 LGVAVKIKSHLTL QATFNRQTGKHHQAKQGALNLQWTF 682 

GV + + L+L Q T N + + +A L ++W F 
Sbjct: 785 SGWAQVNEALSLHGGVQYTANLDS ~ RQQEASGVNLGVRWQF 825 
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sp P76017 Hypothetical protein ycgV [ycgV] [Escherichia 955 AA 

YCGV_ECOLI coli] align 

Score =50.1 bits (118), Expect = le-04 

Identities = 58/218 (26%), Positives = 94/218 (42%), Gaps = 32/218 (14%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFH 469 

+WLR G + + GK + -f G-fQ GG+- L +GL G ST 
Sbjct: 694 I WLRS YGGSLDSFASGKLSGFDMGYSGIQFGGD - - KRLSDVMPLYVGLYIG --STHA 746 

Query: 470 NPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG TE 524 

+PD G + G+YA+ + Q G Y+D -f-f R ++ + D 

Sbjct: 747 S PDYSG - GDGTARSDYMGMYAS YM AQNG F YS DL V I KAS RQ KNS FH VLD S QNNG VNA 801 

Query: 525 RFTS KG I TAS I EAG YNALLAEHFTKKGNS LRVYLQPQAQLT YLGVNG KFSDSENAHV 581 

T+ G++ S+EAG + F Y+ + PQ QLTY N K S+ N H+ 

Sbjct : 8 02 NGTANGMS I S LE AG QRFNLSPTGYGFYIEPQTQLTYSHQNEMTMKASNGLNIHL 855 

Query: 582 N LLGSRQLQTRVGVQA-KAQFSLY-KNIAIEPFA 613 

N LLG + + A +Q + + Y K AI F+ 

Sbjct: 856 NH YE S LLGRASMI LGYD I TAGNSQLNVYVKTGAI RE FS 893 



tr Q881W9 Autotransporter, putative [PSPT02763] [Pseudomonas syringae 927 
(pv. AA 
tomato)] align 

Score = 49.7 bits (117), Expect = 2e-04 

Identities = 69/290 (23%), Positives = 112/290 (37%), Gaps = 39/290 (13%) 

Query: 411 WLRVIDGHSNQWVQGKTAP - VEGYRKGVQLGGEVFTWQNESNQ - LS IGLMGGQAEQ 464 

■ W RV Q Q +P ++ " KG Q+G +V+ W + Q L IGL Q 

Sbjct: 657 WARVFGSDFKQQWSGTVSPGLDASLKGYQIGHDVYAWSLDGQQILRIGLFVAQNRLDGKV 716 

Query: 465 RSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED 521 

FH TG +K G V A W T +Y D+ + R + + 

Sbjct: 717 QGFAGGFHARH TGRI KLHGDS VGAYW - - TLS S PTAS YVDALVMSTRLDGYSRSDR 769 

Query: 522 GTERFTSKG- - ITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENA 579 

G R ++G ++ S+EAG+ +L R +PQ Q+ + ++ + 

Sbjct: 770 GL-RIDTQGHALSLSVEAGHPFVLTP RWVAEPQVQIIHQRIDLDDQHDGIS 819 

Query: 5 80 HVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNK- -PFGVEMDGERRVINN 637 

HV R+G++ K +++L + IEP+ N L+ N V D R+ 

Sbjct: 820 HVGFDSQPYNTGRLGIRFKGRYAL-AGMPIEPYLRAN-LWRNAGGHDTVTFDHTERIKTA 877 

Query: 638 KTAIESQLGVAVKIK SHLTLQATFNRQTGKHHQAKQGA LNLQW 680 

+ LG + IK +4- A 4-NR H + A + L W 

Sb j c t : 87 8 HRSTTGSLGAGMVIKVASDTSVYWGADYNRDLNSHDSSGANASLGVRLAW 92 7 
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tr Q7WL85 Autotransporter [vag8] [Bordetella bronchiseptica (Alcaligenes 915 AA 
bronchisepticus) ] 

align 

Score = 49.7 bits (117), Expect = 2e-04 

Identities = 59/252 (23%), Positives = 100/252 (39%), Gaps = 42/252 (16%) 

Query: 436 GVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQL 495 

G++LG + + S+G++ G +E R D G+V G Y + + L 

Sbjct: 67 8 G I ELG IDRR VAGGATTAWS VGMLAG YS ETRR DGGAYRAGHVHSAHVGAYVSY- -L 73 0 

Query: 496 QDKQTGAYADSWMQYQRFRH- -RINTED GTERFTSKGITASIEAGYNALLAEHFTKK 550 

D +G+Y D ++Y RFRH +TD +SG+A+G + + 

Sbjct: 731 ND--SGSYvT)GWKYNRFRHGFDvT^TTDLKRvT)AKHRSHGLGALLRGGRRIDIDGGW--- 785 

Query: 551 GNSLRVYLQPQAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNI AI 60 9 

Y++PQA + + +G ++ S V + R G +A Q L + 

Sbjct: 786 YVEPQASVAWFHADGSRYEASNGLRVRADSAHSWVLRAGAEAGRQMRLANGNIV 83 9 

Query: 610 EPFA AVNALYHNKPFGVEMDGERRVINNKTAI-ESQLGVAVKIKSHLTLQA 659 

EP+A A NA+Y N G R V ++ E+ + +GV + LA 

Sbjct: 840 EPYARLGWAQELGADNAVYTN G I RH VTRS RGG F AE AR VG VG ALLG KRHAL YA 891 

Query: 660 TFNRQTGKHHQA 671 

+ G +A 
Sbjct: 892 DYEYAKGARFEA 903 



tr Q7W7U5 Autotransporter [vag8] [Bordetella parapertussis] 915 AA 

align 

Score = 49.7 bits (117), Expect = 2e-04 

Identities = 59/252 (23%) , Positives = 100/252 (39%) , Gaps = 42/252 (16%) 

Query: 43 6 GVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQL 4 95 

G++LG + + S+G++ G +E R D G+V G Y ++ L 

Sb j ct : 67 8 G I ELG IDRRVAGGATTAWS VGMLAG YS ETRR DGGAYRAGHVHSAHVGAYVSY- -L 73 0 

Query: 496 QDKQTGAYADSWMQYQRFRH- -RINTED GTERFTSKGITASIEAGYNALLAEHFTKK 550 

D +G+Y D ++Y RFRH +TD +SG+A+G + + 
Sbjct: 731 ND - - SGS YVDGWKYNRFRHGFDVRTTDLKRVDAKHRSHGLGALLRGGRRIDIDGGW 785 

Query: 551 GNSLRVYLQPQAQLTYLGVNG -KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAI 60 9 

Y++PQA +++G++S V + RG+AQL + 

Sbjct: 7 86 YVEPQASVAWFHADGSRYEASNGLRVRADSAHSWVLRAGAEAGRQMRLANGNIV 83 9 

Query: 610 EPFA AVNALYHNKPFGVEMDGERRVINNKTAI -ESQLGVAVKIKSHLTLQA 65 9 

EP+A A NA+Y N G R V ++ E+ ++GV + LA 

Sbjct: 840 EPYARLGWAQELGADNAVYTN G IRHVTRS RGGFAEARVGVGALLGKRHAL YA 891 

Query: 660 TFNRQTGKHHQA 671 

+ G +A 
Sbjct: 892 DYEYAKGARFEA 903 
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tr Q7WHU6 Autotransporter [BB3110] [Bordetella bronchiseptica 770 
(Alcaligenes AA 
bronchisepticus) ] align 

Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGWKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G 4- G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 552 VGAVAG YTNGR I KF DRGGTGDDD S VHVGAYAT Y I E DGGFYMDGIVRVSRIR 602 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H 4D R 4 G+ AS+E G + FT G Y++PQ + + 

Sbjct: 603 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 653 

Query: 570 NG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 628 

G ++ S + G+ + R+G+ QF L ++P+ ++ + E 
Sbjct: 654 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 706 

Query: 62 9 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 67 9 

, DG+ V I +K + E +GVA + + H +L G + AK + 

Sbjct: 707 DGKGTVRTNDIRHKVRLDGGRAELAVGVASQLGKHGSL FGSYEYAKGSRQTMP 759 

Query: 68 0 WTF 682 

WTF 

Sbjct: 760 WTF 762 



tr 086135 Tracheal colonization factor precursor [tcfA2] [Bordetella 647 AA 
pertussis] 

align 

Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D 44 R R 

Sbjct: 429 VGAVAGYTNGRIKF DRGGTGDDD S VH VG AYAT Y I E DGGFYMDGIVRVSRIR 479 

Query: 515 HRINTEDGTER FTS KG I TAS I EAG YNALLAEHFTKKGNS LRVYLQPQAQLT YLG V 569 

H +D R + G+ AS + E G + FT G Y++PQ 4- + 

Sbjct: 480 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 530 

Query: 570 NG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 628 

G ++ S + G+ + R+G+ QF L ++P+ ++ + E 
Sbjct: 531 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 583 

Query: 629 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 679 

DG+ V I +K + E +GVA 4-4- H +L G + AK + 

Sbjct: 584 DGKGTVRTNDIRHKVRLDGGRTELAVGVASQLGKHGSL FGSYEYAKGSRQTMP 63 6 

Query: 680 WTF 682 

WTF 

Sbjct: 637 WTF 639 
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Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 4 55 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 429 VGAVAG YTNGR I KF DRGGTGDDD S VH VG A YAT Y I E DGGFYMDGIVRVSRIR 479 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H +D R + G+ AS+E G 4- FT G Y+ + PQ + + 

Sbjct: 480 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 530 

Query: 570 NG-KFSDSENAHWLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 628 

G + + S + G+ + R+G+ QF L ++P+ + + + E 
Sbjct: 531 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 583 

Query: 62 9 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 67 9 

DG+ V J +K + E +GVA + + H +L G + AK + 

Sbjct: 584 DGKGTVRTNDIRHKVRLDGGRTELAVGVASQLGKHGSL FGSYEYAKGSRQTMP 636 

Query: 68 0 WTF 682 

WTF 

Sbjct: 637 WTF 639 



tr Q9F4B3 BapB protein [bapB] [Bordetella pertussis] 4 82 AA 

align 

Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 264 VGAVAG YTNGR I KF DRGGTGDDD SVHVGAYATYIE DGGFYMDGIVRVSRIR 314 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H +D R + G+ AS+E G + FT G Y-f + PQ + + 

Sbjct: 315 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 365 

Query: 570 NG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNI AIEPFAAVNALYHNKPFGVEM 628 

G ++ S + G+ + R+G+ QF L ++P+ + . e 
Sbjct: 366 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ ---EF 418 

Query: 62 9 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 67 9 

DG+ V I +K + E +GVA ++ H +L G + AK + 

Sbjct: 419 DGKGT VRTND I RHKVRLDGGRTEL AVGVAS QLGKHGS L FGSYEYAKGSRQTMP 471 

Query: 680 WTF 682 
WTF 

Sbjct: 472 WTF 474 



tr Q8GB90 Tracheal colonization factor [tcfA] [Bordetella pertussis] 647 AA 
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align 

Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G 4- G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 429 VGAVAG YTNGR I KF DRGGTGDDDSVHVGAYATYIE DGGFYMDGIVRVSRIR 479 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H +D R + G+ AS + E G + FT G Y-f + PQ + + 

Sbjct: 480 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 530 

Query: 57 0 NG-KFSDSENAHWLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 62 8 

G ++ S + G+ + R+G+ QF L ++P+ + + + E 
Sbjct: 531 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 583 

Query: 62 9 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 67 9 

DG+ V I +K 4- E +GVA ++ H +L G + AK + 

Sbjct: 584 DGKGTVRTNDIRHKVRLDGGRTELAVGVASQLGKHGSL FGSYEYAKGSRQTMP 636 

Query: 680 WTF 682 

WTF 

Sbjct: 637 WTF 639 



tr Q45343 Tracheal colonization factor [tcfA] [Bordetella pertussis] 672 AA 

align 

Score =49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 455 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 454 VGAVAG YTNGR I KF DRGGTGDDDSVHVGAYATYIE DGGFYMDGIVRVSRIR 504 , 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H +D R + G+ AS+E G + FT G Y++PQ ++ 

Sbjct: 505 HAFKVDDAKGRRVRGQYRGNGVGASLELG- KRFTWPG AWYVEPQLEVAAFHA 555 

Query: 570 NG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 62 8 

G ++ S + G+ + R+G+ QF L ++P+ ++ + E 
Sbjct: 556 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 608 

Query: 629 DGERRV INNKTAI ESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQ 679 

DG+ V I +K + E +GVA ++ H +L G + AK + 

Sbjct: 609 DGKGTVRTND I RHKVRLDGGRTEL AVGVAS QLGKHGS L FGSYEYAKGSRQTMP 661 

■\ 

Query: 680 WTF 682 

WTF 

Sbjct: 662 WTF 664 



http://us.expasy.org/cgi-bin/blast.pl 



9/14/04 



ExPASy BLAST2 Interface 



Page 47 of 62 



tr Q6U948 Tracheal colonization factor protein [tcfA] [Bordetella 642 
pertussis] AA 

align 

Score = 49.3 bits (116), Expect = 2e-04 

Identities = 57/243 (23%), Positives = 95/243 (38%), Gaps = 47/243 (19%) 

Query: 4 55 IGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFR 514 

+G + G R F D TG+ G YAT+ + G Y D ++ R R 

Sbjct: 424 VGAVAG YTNGR I KF DRGGTGDDDSVHVGAYATYIE DGGF YMDG I VRVS R I R 474 

Query: 515 HRINTEDGTER FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGV 569 

H +D R + G+ AS+E G + FT G Y+4-PQ 4-+ 

Sbjct: 475 HAFKVDDAKGRRVRGQYRGNGVGASLELG KRFTWPG AWYVEPQLEVAAFHA 525 

Query: 570 NG-KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEM 628 

G ++ S + G+ + R+G+ QF L ++P+ ++ + E 
Sbjct: 526 QGADYTASNGLRIKDDGTNSMLGRLGLHVGRQFDLGDGRWQPYMKLSWVQ EF 578 

Query: 62 9 DGERRV INNKTAI ES QLGVAVKI KSHLTLQATFNRQTGKHHQAKQGALNLQ 67 9 

DG+ V I +K + E +GVA ++ H +L G + AK + 

Sbjct: 579 DGKGTVRTND IRHKVRLDGGRTELAVGVAS QLGKHGSL FGSYEYAKGSRQTMP 631 

Query: 680 WTF 682 

WTF 

Sbjct: 632 WTF 634 



tr Q7WKS2 Putative autotransporter [bapC] [Bordetella bronchiseptica 993 AA 
(Alcaligenes bronchisepticus) ] 

align 

Score = 48.1 bits (113), Expect = 5e-04 

Identities = 96/486 (19%) , Positives = 174/486 (35%) , Gaps = 80/486 (16%) 

Query: 234 GQT I S LDE FRLENS LWEPRWD SNVGKLKTTNAD IRFNTKS E S LLVKEDYAGGAR 287 

G T +++ R+E+ W S V L + + T +ES L VK +G 

Sbjct: 543 GITMTVNALRIEDGTWTVTGSSAVNSLHLQAGKVAYATPAESDGKFKHLRVKT-LSGSGL 601 , 

Query: 288 FRFAYDPKEAKNTALIFEKNVTGTSDIIFENP IDDLKSLD GHQIIKV 334 

F + L+ +G ++ ++ L ++ G + 

Sbjct: 602 FEMNASADLSDGDLLWSDEASGQHKVLVRGAGTEPTGVESLTLVELPKGSQAGFTLANR 661 

Query: 3 35 NGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKN — -AMIWRFMHNRLKPPIP 3 87 

G D AFR G++ L + N +IWNL + 

Sbjct: 662 GGVVDAGAFRYRLTPDNGVWGLERTSQLSAVANAALNTGGVGAASSIWYAEGNALSKRLG 721 

Query: 388 YCALRLNMKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQ- -LGGEVFT 445 

LRL+ + RT +K + +D + + K V G+ G + G+ 
Sbjct: 722 - -ELRLDPGAGGFWGRTFAQK QQLDNKAGRRFDQK VYGFELGADHAIAGQQGR 772 

Query: 44 6 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 505 

W +G+G R+F+ +++++AV GYD 
Sbjct: 773 WH VGGLLG YTRARRS F VDDGAGHTD S AH IGAYAAYV ADNGFYFD 816 

Query: 506 SWMQYQRFRH- -RINTEDGTE RFTS KG I TAS I EAG YNALL AEHFTKKGNS LRVYLQP 560 

S ++ RF + + D ++ + G+ A++EAG L + + ++^P 
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Sbjct: 817 STLRASRFENDFTVTATDAVSIRGKYRANGVGATLEAGKRFTLHDGW FVEP 867 

QAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNI AIEPFAAVNALY 619 

Q+ + + + +G + + N V G R+G+ A + L K+ I + P+-A ++ L 



EG VM + + LG ++LLA R +AK L 

- EFKGVTTVRTNGYGLRTDLG-GGRAELALGLAAALGRGHKFYTS YEYAKGNKL 979 



Sbjct : 


817 


Query : 


561 


Sbjct : 


868 


Query : 


620 


Sbjct: 


928 


Query : 


677 


Sbjct: 


980 



L WTF 



tr Q83RP3 Putative adhesion and penetration protein [SF1205] [Shigella 955 AA 
f lexneri] 

align 

Score =47.4 bits (111), Expect = 0.001 

Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 32/218 (14%) 

Query: 410 LWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFH 469 

+WLR G + + GK + + G+Q GG+ ++ L +GL ST 
Sbjct: 694 IWLRSYGGSLDSFASGKLSGFDMGYSGIQFGGD- -KRLSDVMPLYVGLY IDSTHA 746 

Query: 470 NPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG TE 524 

+PD G + G+YA++ Q G Y+D ++ R + + + D 

Sbjct: 747 S PDYSG - GDGTARSD YMGMYAS YM AQNGFYSDLVI KASRQKNS FHVLDSQNNGVNA 801 

Query: 52 5 RFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG KFSDSENAHV 581 

T+ G+4- S+EAG + F Y+ + PQ QLTY N K S+ N H+ 

Sbjct: 802 NGTANGMSISLEAG QRFNLSPTGYGFYIEPQTQLTYSHQNEMAMKASNGLNIHL 855 

Query: 582 N LLGSRQLQTRVGVQA-KAQFSLY-KNIAIEPFA 613 

N LLG + + A +Q ++Y K AI F+ 

Sbjct: 856 NH YE S L LGRAS M I LG YD I T AGNS QLNV Y VKTGA I RE F S 893 



tr Q7WK68 Autotransporter [BB2270] [Bordetella bronchiseptica . 559 
(Alcaligenes 

bronchisepticus) ] align 
Score = 47.4 bits (111), Expect = 0.001 

Identities = 50/219 (22%), Positives = 85/219 (37%), Gaps = 42/219 (19%) 

Query: 479 GNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG TERFTSKGITA 533 

GN G YAT+ G Y D ++ R+ H + T ++ + GI 

Sbjct: 360 GNSDSLHVGAYATYIG DGGFYLDGIVRVNRYEHDFKADGQRGARVTGKYRANGIGL 415 

Query: 534 SIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK-FSDSENAHVNLLGSRQLQTR 592 

S+E G FT G+ +++PQ ++ Q +4- S V++ ++ L R 

Sbjct: 416 SLETG RRFTWAGDW FVEPQVEVALFRSGGADYTASNGVRVDVASTKSLLGR 466 

Query: 593 VGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTA IES 64 3 
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G+Q 4 L + + P+A 44 L E DG +V N E 

Sbjct: 467 AGLQVGRKLDLGNGKLVQPYAKLSWLQ EFDGVGKVRTNDIGHDVKLRGGRAEL 519 

Query: 644 QLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

LGVA + H +L A++ G L 4 W+F 

Sbjct: 520 DLGVAAALGRHSSLFASYEYSKGSR LTIPWSF 551 



tr Q7WD6 Autotransporter (Pseudogene) [bapC] [Bordetella pertussis] 993 AA 

align 

Score = 47.4 bits (111) , Expect = 0.001 

Identities = 93/486 (19%) , Positives = 173/486 (35%) , Gaps = 80/486 (16%) 

Query: 234 GQT I S LDE FRLENS LWEPRWD SNVGKLKTTNAD I RFNTKS E S - - LLVKED YAGGAR 287 

G T +44 R+E+ W S V L + + T 4-ES L VK +G 

Sbjct: 543 GMTKTWALRIEDGTWTVTGSSTVNSLHLQAGKVAYATPAESDGEFKHLRVK-TLSGSGL 601 

Query: 288 FRFAYDPKEAKNTALIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKV 334 

F + L+ +G 44 + 44 +++ + 

Sbjct: 602 FEMNASADLSDGDLLWSDEASGQHKVLVRGAGTEPTGVESLTLVELPEGSQTKFTLANR 661 

Query: 33 5 NGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKN AMIWRFMHNRLKPPIP 3 87 

G D AFR G44 L 4 N + IW ' N L + 

Sbjct: 662 GGWDAGAFRYRLTPDNGVWGLERTSQLSAVANAALNTGGVGAASSIWYAEGNALSKRLG 721 

Query: 388 YCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRJCGVQ- -LGGEVFT 445 

LRL+ + RT +K + +D + + K V G+ G " + G+ 
Sbjct: 722 - -ELRLDPGAGGFWGRTFAQK QQLDNKAGRRFDQK- - -VYGFELGADHAIAGQQGR 772 

Query: 44 6 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 505 

W +G + G R+F+ + +++ +*AV GYD 
Sbjct: 773 WH VGGLLGYTRARRSFIDDGAGHTDSAHIGAYAAYV ADNGFYFD 816 

Query: 506 SWMQYQRFRH- -RINTEDGTE RFTS KG I TAS I E AGYNALLAEHFTKKGNS LRVYLQP 560 

S 44 RF 4 4- D 44- 4 G4 A44EAG L 4 4 444P 

Sbjct: 817 STLRASRFENDFTVTATDAVSVRGKYRANGVGATLEAGKRFTLHDGW FVEP 867 

Query: 561 QAQLT- YLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

Q4444 4 G + 4 N V G R4G4 A 4- L K+ I4P4-A 44 L 

Sbjct: 868 QSEVSLFHASGGTYRAANNLSVKDEGGTSAVLRLGLAAGRRIDLGKDRVIQPYATLSWLQ 927 

Query: 62 0 HNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQATFNR QTGKHHQAKQGAL 676 

EG VN +4L 4+LLA R +AK L 

Sbjct: 92 8 EFKGVTTVRTNGYGLRTDLS -GGRAELALGLAAALGRGHQLYTSYEYAKGNKL 979 

Query: 677 NLQWTF 682 

L WTF 

Sbjct: 980 TLPWTF 985 



tr Q7C1Y6 Putative adhesion and penetration protein [S1289] [Shigella 955 AA 
f lexneri] 

alig n 
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tr O86044 Putative autotransporter (BapC protein) [bapC] [Bordetella 759 AA 
pertussis] 

align 

Score = 47.4 bits (111), Expect = 0.001 

Identities = 93/486 (19%) , Positives = 173/486 (35%) , Gaps = 80/486 (16%) 

Query: 234 GQTISLDEFRLENSLWEPRWDSNVGKLKTTNADIRFNTKSES LLVKEDYAGGAR 287 

G T +++ R+E+ W S V L + + T +ES L VK +G 

Sbjct: 309 GMTKTVNALR I EDGTWTVTGS S TVNSLHLQAGKVAYATPAE SDGEFKHLRVK - TL SGSGL 367 

Query: 288 FRFAYDPKEAKNTALIFEKNVTGTSDI IFENPIDDLKSLDGHQI IKV 334 

F + L+ +G - ++ + ++ ++ + + 

Sbjct: 368 FEMNASADLSDGDLLWSDEASGQHKVLVRGAGTEPTGVESLTLVELPEGSQTKFTLANR 427 

Query: 335 NGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKN AMIWRFMHNRLKPPIP 387 

G D AFR G++ L+ N +IWNL + 

Sbjct: 42 8 GGVVDAGAFRYRLTPDNGVWGLERTSQLSAVANAALNTGGVGAAS S I WYAEGNALS KRLG 487 

Query: 3 88 YCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQ- -LGGEVFT 445 

LRL+ + RT +K + +D + + K V G+ G + G+ 
Sbjct: 488 - -ELRLDPGAGGFWGRTFAQK QQLDNKAGRRFDQK VYGFELGADHAIAGQQGR 53 8 

Query: 446 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 505 

W +G + G R + F + + + ++ + A V GYD 
Sbjct: 539 WH VGGLLGYTRARRSFIDDGAGHTDSAHIGAYAAYV ADNGFYFD 582 

Query: 506 SWMQYQRFRH- -RINTEDGTE RFTS KG I TAS I E AG YNALLAEHFTKKGNS LRVYLQP 560 

S ++ RF + + D + + + G+ A++EAG L + + +++P 
Sbjct: 583 STLRASRFENDFTVTATDAVSVRGKYRANGVGATLEAGKRFTLHDGW FVEP 633 

Query: 561 QAQLT- YLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

Q++++ + G + + N V G R+G+ A + L K+ I+P+A ++ L 

Sbjct: 634 QSEVSLFHASGGTYRAANNLSVKDEGGTSAVLRLGLAAGRRIDLGKDRVIQPYATLSWLQ 693 

Query: 62 0 HNKP FG VEMDGE RR V I NNKT A I E S QLG VAVK I KS HLTL Q AT FNR QTGKHHQAKQGAL 676 

EG VN + + L ++LLA R + AK L 

Sbjct: 694 EFKGVTTVRTNGYGLRTDLS - GGRAELALGLAAALGRGHQLYTSYEYAKGNKL 745 

Query: 677 NLQWTF 682 

L WTF 

Sbjct: 746 TLPWTF 751 



tr Q7W8V8 Autotransporter [BPP2022] [Bordetella parapertussis] 53 8 AA 

align 

Score =47.0 bits (110), Expect = 0.001 

Identities = 52/220 (23%), Positives = 89/220 (39%), Gaps = 44/220 (20%) 

Query: 479 GNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDG TERFTSKGITA 533 

GN G YAT+ GYD ++ R-f- H + T ++ + GI 

Sbjct: 339 GNSD S LHVGAYAT Y I G DGGF YLDG I VRVNR YEHD FKADGQRGARVTGKYRANG I GL 3 94 

Query: 534 SIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK- -FSDSENAHVNLLGSRQLQT 591 
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S+E G FT G+ + + + PQ + + L +GK ++ S V+ + ++ L 
Sbjct: 395 SLETG RRFTWAGDW FVEPQVEVA-LFRSGKADYTASNGVRVDVASTKSLLG 444 

Query: 592 RVGVQAKAQFSLYKNIAIEPFAAWALYHNKPFGVEMDGERRVINNKTA IE 642 

R+G+Q + L + + P+A ++ L E DG +V N E 

Sbjct: 445 RLGLQVGRKLDLGNGKLVQPYAKLSWLQ E FDGVGKVRTND I GHD VKLRGGRAE 497 

Query: 643 SQLGVAVKIKSHLTLQATFNRQTGKHHQAKQGALNLQWTF 682 

LGVA + .H +L A++ G L + W+F 

Sbjct: 498 LDLGVAAALGKHS S L F AS YE YS KGS R LTIPWSF 530 



tr Q7W1F5 Autotransporter [BPP0735] [Bordetella parapertussis] 984 AA 

align 

Score = 46.2 bits (108), Expect = 0.002 

Identities = 72/280 (25%), Positives = 110/280 (38%), Gaps = 39/280 (13%) 

Query: 4 07 RKGLWLRVIDGHSNQWVQGKTAP -VEGYRKGVQLGGEVFTWQNE SNQLS IGLMGGQA 462 

RK W RVI QGP+G QG++ + S ++L + GQA 

Sbjct: 711 RKAGWGRVIGQRLTQRWDGDVEPRFKGNIWIAQAGADMLERDRDDGLSDRLGLFSAYGQA 770 

Query: 463 EQRSTFHNPDTDNLTTGN- -VKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTE 520 

+ R G V+ +G G+Y W +L K T Y D+ + + R + + 

Sbjct: 771 DGRVDGFVQGEHGKQAGKLRVEAYGLGLY - - WTRL - - KHTNWYWDNVLMGNYYTGRSRSD 826 

Query: 521 DGTER-FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENA 579 

G G TAS EAGY+ H + LQPQAQL Y +++ +N 

Sbjct: 827 RGVAASLEGWGFTASSEAGYS - FFPRH DIMLQPQAQLVY QYTSLDNT 872 

Query: 580 H VNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRV 634 

H + G L R+G+ + +1 P-fA +N L+H GE 
Sbjct: 873 HDAYSTIRYHGGGALTGRIGLLLQGNADQPER- - IRPYARIN-LWHR FSHGESVS 924 

Query: 63 5 INNKTAIESQLGVAVKIKSHLTLQATFNRQTGKHHQAKQG 674 

+1 ++ G + + + L A NRQT + A G 
Sbjct: 925 FGPSDSIRTEYG-STSMDLRIGLAAPLNRQTELYASAGYG 963 



tr Q7WP64 Autotransporter [BB0 821] [Bordetella bronchi septic a 987 
(Alcaligenes AA 
bronchisepticus) ] align 

Score = 45.4 bits (106), Expect = 0.003 

Identities = 71/275 (25%), Positives = 106/275 (37%), Gaps = 29/275 (10%) 

Query: 407 RKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQLS- - IGLMG- -GQA 462 

RK W RVI Q G P + G ++ + LS +GL G GQA 

Sbjct: 714 RKAGWGRVIGQRLTQRWDGDVEPRFKGNIWIAQAGADMLERDRDDGLSDRLGLFGAYGQA 773 

Query: 4 63 EQRSTFHNPDTDNLTTGN- -VKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTE 52 0 

+ R G V+ +G G+Y W +L K T Y D+ + + R ++ 

Sbjct: 774 DGRVDGFVQGEHGKQAGKLRVEAYGLGLY- -WTRL- -KHTNWYWDNVLMGNYYTGRSRSD 829 

Query: 521 DGTER-FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENA 579 
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G G TAS EAGY+ H + LQPQAQL Y + + + 

Sbjct: 830 RGVAASLEGWGFTASSEAGYS - FFPRH DIMLQPQAQLVYQYTSLDDTHDAYS 880 

Query: 580 HVNLLG S RQLQTR VG VQAKAQ F S L Y KN I A I E P F AAVNAL YHNKP FG VEMDGE RR V I NNKT 63 9 

+ G L R+G+ + +1 P+A +N L+H GE 
Sbjct: 881 TIRYHGGGALTGRIGLLLQGNADQPER- - IRPYARIN-LWHR FSHGESVSFGPSD 932 

Query: 64 0 AI E S QLGVAVKI KSHLTLQATFNRQTGKHHQAKQG 674 

+1 ++ G + + + L A NRQT + A G 
Sbjct: 933 S IRTEYG - STSMDLRIGLAAPLNRQTELYASAGYG 966 



tr Q6G430 Hypothetical protein [BH05490] [Bartonella henselae 
(Rochalimaea 
henselae) ] 

Score = 44.3 bits (103), Expect = 0.008 
Identities = 55/225 (24%), Positives = 91/225 (40%), Gaps = 42/225 (18%) 

Query: 479 GNVKG FGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTE RFTSKG 530 

G V G + G YAT+ ++G Y DS ++Y ++++ + + G 

Sbjct: 1088 GGVSGINTYSIGAYATYFD KSGWYLDSILKYNQYQNNLKAVSTNGIAIEGNYNQWG 1143 

Query: 531 ITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNG KFSDSENAHVNLLGSR 587 

+ S EAGY TK +4-QP AQL++L V G K Sh~ +N S 

Sbjct: 1144 LGTS FEAGYRF ETTKSS WMQPYAQLSWLQVEGKEIKLSNEMTGDINAFTS - 1193 

Query: 588 QLQTRVGVQAKAQFS L YKN I AI E P F AAVNAL YHNKP FG VEMDGERRVINNKTAI E SQL - - 645 

L++ VG+ A +F L ++- + L N+ D R IN + L 

Sbjct: 1194 -LRSEVGLSAGYEFCLGGDVTSMAYITAAWLRENR DSNRTTINKLHQFVTDLSG 1246 

Query: 646 GVAVKI KSHLTLQATFNRQTG - KHHQAKQGALNLQWTF 682 

G++ + LA G K Q+ QG L ++++F 

Sbjct: 1247 NFGKLGIGLSSLVSKKFKLYAEAQYVKGDKVKQSFQGILGVRYSF 1291 



1291 
AA 

align 



tr Q7W7D4 Putative autotransporter [bapC] [Bordetella parapertussis] 986 AA 

align 

Score = 44.3 bits (103), Expect = 0.008 

Identities = 93/485 (19%), Positives = 172/485 (35%), Gaps = 80/485 (16%) 

Query: 234 GQTISLDEFRLENSLWEPRWDSNVGKLKTTNADIRFNTKSES LLVKEDYAGGAR 287 

G T +++ R+E+ W S V L + + T +E+ L VK +G 

Sbjct: 536 GITKTVNALRIEDGTWTVTGSSAVNSLHLQAGKVAYATPAETNGKFKHLRVKT-LSGSGL 594 

Query: 288 FRFAYDPKEAKNTALIFEKNVTGTSDIIFENP IDDLKSLD GHQIIKV 334 

F + L+ +G ++ ++ L +4- G + 

Sbjct: 595 FEMNASADLSDGDLLWSDEASGQHKVLVRGAGTEPTGVESLTLVELPKGSQAGFTLANR 654 

Query: 335 NGTADKHAFRLSGKHQKGIYTLSLQQRPEGFLPKCKN AMIWRFMHNRLKPPIP 3 87 

G D AFR G++ L+ N +IWNL + 

Sbjct: 655 GGWDAGAFRYRLTPDNGVWGLERTSQLSAVANAALNTGGVGAASSIWYAKGNALSKRLG 714 
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Query: 3 88 YCALRLNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQ- -LGGEVFT 44 5 

' LRL+ + RT +K + +D ' + + K V G+ G + G+ 
Sbjct: 715 - -ELRLDPGAGGFWGRTFAQK QQLDNKAGRRFDQK VYGFELGADHAIAGQQGR 765 

Query: 44 6 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 5 05 

W +G + G R +F + + ++++AV GYD 
Sbjct: 766 WH VGGLLG YTRARRS F VDDGAGHTDS AH I GAYAAYV ADNGFYFD 809 

Query: 506 SWMQYQRFRH- -RINTEDGTE RFTS KG ITAS I E AGYNALLAEHFTKKGNS LRVYLQP 560 

S RF + + D ++ + G+ A++EAG L + + + + + P 

Sbjct: 810 STLRASRFENDFTVTATDAVSIRGKYRANGVGATLEAGKRFTLHDGW FVEP 860 

Query: 561 QAQLTYLGVNG - KFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

+ + + +G + + N V G R+G+ + L K+ I+P+A ++ L 

Sbjct: 861 QSEVSLFHASGGTYRAAMNLSVKDEGGTSAVLRLGLATGRRIDLGKDRVIQPYATLSWLQ 920 

Query: 62 0 HNKPFGVEMDGERRVINNKTAIESQLGVAVKIKSHLTLQATFNRQTG KHHQAKQGAL 676 

EG V N + + LG + + L L A R + AK L 

Sbjct: 921 EFKGVTTVRTNGYGLRTDLG-GGRAELALGLAAALGRGHSLYTSYEYAKGSKL 972 

Query: 677 NLQWT 681 

L WT 

Sbjct: 973 TLPWT 977 



tr Q9JMS5 YcbB protein [ycbB] [Escherichia coli] 1758 AA 

align 

Score = 43.5 bits (101), Expect = 0.013 

Identities = 29/112 (25%) , Positives = 54/112 (47%) , Gaps = 12/112 (10%) 

Query: 437 VQLGGEVFTWQNESNQLS IGLMG - - GQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQ 4 94 

+ +GG+++ W + N IG+MG G A ++T N + TGNV G+ G+Y + Q 
Sbjct: 14 99 MM I GGD I YNWTD G YN Y S H I G I MGGMG S AANKT T S TN NKRATGNVDGYTLGLYHVFQQ 1555 

Query: 495 LQDKQTGAYADSWMQYQRFRHRI-NTEDGTERFTSKGITASIEAGY 539 

+ ++ G + S +QY ++++T+ + G + E GY 

Sbjct: 1556 NI S DGLNE S ERQGL WT YS S I Q YMD YDNS VS S TNNFKAN YGVNGFRLTGE VG Y 1607 



tr Q9XD84 TibA [tibA] [Escherichia coli] 989 AA 

align 

Score =43.1 bits (100), Expect = 0.017 

Identities = 50/210 (23%) , Positives = 86/210 (40%) , Gaps = 26/210 (12%) 

Query: 3 93 LNNKNSDIFDRTLPRKGLWLRVIDGHSNQWVQGKTAPVEGYRKGVQLGGEVFTWQNESNQ 452 

L ++ D+ T G+W R G N+ G ++ + G + G ++-S + 

Sbjct: 711 LRFRHGD VMQNTRAPGG VWGRYT - GSDNR I SGGAS S G YTLTQNGFETGADMVFDL S DS S - 768 

Query: 453 LSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQR 512 

L++G ++ S H + NV G G+YATW GYD ++Y R 

Sbjct: 769 LAVGTFFS YSDN -SI KHARGGKS - NVDSSGGGL YATWFD NDGYYVDGVLKYNR 819 

Query: 513 FRHRINT- -EDGTE- - -RFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYL 567 
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F + + T DGT + + G S+EAG L E+ + QP + T 

Sbjct: 820 FNNELRTWMSDGTAVKGDYSQNGFGGSLEAGRTFSLNEN AWAQ P YVRTTAF 870 

Query: 568 GVNGKFSDSENAHVNLLG - SRQLQTRVGVQ 596 

+ K N +G ++ LQ G+ + 

Sbjct: 871 RADKKE IRLNNGMKAS IGATKSLQAEAGLK 900 



tr Q6G428 Hypothetical protein [BH0 5510] [Bartonella henselae 874 
(Rochalimaea AA 
henselae) ] align 

Score =42.4 bits (98), Expect = 0.029 

Identities = 54/225 (24%), Positives = 90/225 (40%), Gaps = 42/225 (18%) 

Query: 47 9 GNVKG FGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTE RFTSKG 53 0 

G V G + G YAT+ ++G Y DS ++Y ++++ + + G 

Sbjct: 671 GGVSGINTYS IGAYATYFD KS GW YLDS I LKYNQ YQNNLKAVS TNG I AI EGNYNQ WG 726 

Query: 531 ITASIEAGYNALLAEHFTKKGMSLRVYLQPQAQLTYLGVNG KFSDSENAHVNLLGSR 587 

+ S EAGY TK ++QP AQL-f+L V G K S+ +N S 

Sbjct: 727 LGTSFEAGYRF ETTKSS WMQP YAQLS WLQVEGKE I KLSNEMTGD INAFTS - 776 

Query: 588 QLQTRVGVQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGERRVINNKTAIESQL- - 645 

L-f+ VG+ A + F L ++ + h N+ D IN + L 

Sbjct: 777 -LRSEVGLSAGYEFCLGGDVTSMAYITAAWLRENR D S NHTT I NKLHQ F VTDL S G 829 

Query: 646 GVAVKIKSHLTLQATFNRQTG-KHHQAKQGALNLQWTF 682 

G++ + Li A G K Q+ QG L ++ + + F 

Sbjct: 83 0 NFGKLG IGLS S LVS KKFKLYAEAQYVKGDKVKQS FQG I LGVRYS F 874 



tr Q83JR2 Exported serine protease SigA [sigA] [Shigella 12 85 AA 

flexneri] align 

Score = 42.4 bits (98), Expect = 0.029 

Identities = 71/307 (23%), Positives = 124/307 (40%), Gaps = 48/307 (15%) 

Que ry : 3 93 LNNKNSD I FDRTL PRKGLWLRVIDGH SNQWVQGKTAPVEGYRKGVQLGG - EVFTWQN 44 8 

LN + D-f D T G W R++.G S + T G K +L G ++FT 
Sbjct: 1010 LNKRMGDLRD-TNGEAGAWARIMSGAGSASGGYSDNYTHVQIGVDKKHELDGLDLFT 1065 

Query: 44 9 ESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWM 5 08 

GL + ++ N +G K GAG+YA+ +GAY D 

Sbjct: 1066 GLTMTYTDSHAS SNAFSGKTKSVGAGLYAS AIFDSGAYIDLIS 1108 

Query: 509 QYQRFRHRINTED GTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLT 565 

+Y + + GT+ ++S + EAGY H T+ +++PQA+L 
Sbjct: 1109 KYVHHDNEYSATFAGLGTKDYSSHSLYVGAEAGYRY HVTEDS WIEPQAELV 1159 

Query: 566 YLGVNGKFSDSENAHVNLL GSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY - - 619 

Y V+GK D ++ +++ L R GV FS K+ + A + + 

Sbjct: 1160 YGAVSGKRFDWQDRGMSVTMKDKDFNPLIGRTGVDVGKSFS-GKDWKVTARAGLGYQFDL 1218 

Query : 62 0 - -HNKPFGVEMDGERRVINNKTA- IESQLGVAVKIKSHLTLQATFNRQT-GKHHQAKQGA 675 
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+ + + GE+R+ K I +G+ +1+ +L F + GK++ 

Sbjct: 1219 FANGETVLRDASGEKRI KGEKDGRI LMNVGLNAE IRDNLRFGLE FEKS AFGKYNVDNAIN 1278 

Query: 676 LNLQWTF 682 

N + + + F 
Sbjct: 1279 ANFRYSF 1285 



tr Q9L8L1 Exported serine protease SigA [sigA] [Shigella flexneri 2a] 12 85 AA 

align 

Score = 42.4 bits (98), Expect = 0.029 

Identities - 71/307 (23%) , Positives = 124/307 (40%) , Gaps = 48/307 (15%) 

Query : 3 93 LNNKNSDIFDRTLPRKGLWLRVIDGH- - - SNQWVQGKTAPVEG YRKGVQLGG - EVFTWQN 4 4 8 

LN + D+ D T G W R++ G S + T GK+LG ++FT 
Sbjct: 1010 LNKRMGDLRD - TNGEAGAWARIMSGAGSASGGYSDNYTHVQIGVDKKHELDGLDLFT 1065 

Query: 449 ESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWM 508 

GL + ++ N +G K GAG+YA4- +GAY D 

Sbjct: 1066 GLTMTYTDSHAS SNAFSGKTKSVGAGLYAS AIFDSGAYIDLIS 1108 

Que ry : 5 0 9 Q YQRFRHR INTED GTERFTS KG I TAS I EAG YNALLAEHFTKKGNS LRVYLQPQAQLT 565 

+Y + + GT+ + + S + EAGY H T+ +++PQA+L 
Sbjct: 1109 KYVHHDNEYSATFAGLGTKDYSSHSLYVGAEAGYRY HVTEDS WIEPQAELV 1159 

Query: 566 YLGVNGKFSDSENAHVNLL GSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY- - 619 

Y V+GK D +++ L R GV FS K+ + A + + 

Sbjct: 1160 YGAVSGKRFDWQDRGMSVTMKDKDFNPLIGRTGVDVGKSFS-GKDWKVTARAGLGYQFDL 1218 

Query: 62 0 - -HNKPFGVEMDGERRVINNKTA-IESQLGVAVKIKSHLTLQATFNRQT-GKHHQAKQGA 675 

+ + + GE+R+ K I +G+ +1+ +L F + GK+ + 

Sbjct: 1219 FANGETVLRDASGEKRIKGEKDGRILMNVGLNAEIRDNLRFGLEFEKSAFGKYNVDNAIN 1278 

Query: 676 LNLQWTF 682 

N + + + F 

Sbjct: 1279 ANFRYSF 1285 



tr Q7C013 Serine protease [sigA] [Shigella flexneri] 1285 AA 

align 

Score =42.4 bits (98), Expect = 0.029 

Identities = 71/307 (23%), Positives = 124/307 (40%), Gaps = 48/307 (15%) 

Query : 3 93 LNNKNSDIFDRTLPRKGLWLRVIDGH SNQWVQGKTAPVEG YRKGVQLGG - EVFTWQN 44 8 

LN + D+ D T G W R++ G S + T GK-fLG + + FT 
Sbjct: 1010 LNKRMGD LRD - TNGE AG AWAR IMS GAGS AS GG Y S DN YTHVQ I G VD KKHE LDGLDL FT 1065 

Query: 44 9 ESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWM 5 08 

GL + ++ N +G K GAG+YA+ +GAY D 

Sbjct: 1066 GLTMTYTDSHAS SNAFSGKTKSVGAGLYAS AIFDSGAYIDLIS 1108 

Query : 5 09 Q YQRFRHR INTED GTERFTSKGITAS I EAG YNALLAEHFTKKGNS LRVYLQPQAQLT 565 
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+Y + + GT+-++S + EAGY H T+ ++ + PQA+L 

Sbjct: 1109 KYVHHDNEYSATFAGLGTKDYSSHSLYVGAEAGYRY HVTEDS WIEPQAELV 1159 

Query: 566 YLGVNGKFSDSENAHVNLL GSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY- - 619 

Y V+GK D ++ +++ L R GV FS K+ + A + + 

Sbjct: 1160 YGAVSGKJIFDWQDRGMSVTMKDKDFNPLIGRTGVDVGKSFS-GKDWKVTARAGLGYQFDL 1218 

Query: 620 - -HNKPFGVEMDGERRVINNKTA- IESQLGVAVKIKSHLTLQATFNRQT-GKHHQAKQGA 675 

+ •+• + GE+R+ K I" +G + + +L F + GK+ + 

Sbjct: 1219 FANGETVLRDASGEKRIKGEKDGRILMNVGLNAEIRDNLRFGLEFEKSAFGKYNVDNAIN 1278 

Query: 676 LNLQWTF 682 

N + + + F 
Sbjct: 12 7 9 ANFRYSF 12 85 



tr Q8UJW1 Autotransporter protein [yapE] [Agrobacterium tumefaciens 868 
(strain AA 
C58 / ATCC 33970)] align 

Score =40.0 bits (92), Expect = 0.14 

Identities = 66/293 (22%), Positives = 113/293 (38%), Gaps = 46/293 (15%) 

Query: 411 WLRVIDGHSNQWVQGKTAP - VEGYRKGVQLGGEVFTWQNESNQLS - IGLMGGQAEQRSTF 468 

W R + + G +P +G G+Q G ++ + E+ GL G + 

Sbjct: 5 99 WARTFGQNTEMKWDGTVSPSFDGNLFGLQAGQDLLGRETEAGGFDRFGLFVGYSRMNGDI 658 

Query: 469 HNPDT- -DNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTE-R 525 

-fNL GV G AW + + G Y D+ + F + G 

Sbjct: 659 KGQALGWNNLAVGEVDIGGTSFGAYWTHVGAQ- -GWYLDAVLMGTWFSGDATSRAGESVN 716 

Query: 526 FTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN GKFS DS 576 

+ AS+E GY L E +T L.+PQAQ+ + ++ +FS DS 

Sbjct: 717 IDGSSVAASLEGGYPIALTEDWT LEPQAQI IWQKLSLDDEADRFSSVAFDS 767 

Query: 577 ENAHVNLLGSR QLQTRVG-VQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER 632 

+NA LG R QT G +Q + + + + + + + N P EG 

Sbjct: 768 DNAVTGRLG VRLQGN YQTD S GL I Q P YL KAN I WHG F S S D QMTR FDNDPIVTETGG- - 821 

Query: 633 RVINNKTAIESQLGVAVKIKSHLTLQATFNRQT GKHHQAKQG- -ALNLQW 680 

T++E G+ + + + + AT + T G+ +A +G LN++W 
Sbjct: 822 TS LE IGGGLVASLTEKVS VFATVD YTTNLGGERKRAI EGNI GLNI KW 868 



tr Q7D396 AGR_pAT_528p [AGRjpAT_528] [Agrobacterium tumefaciens (strain 921 
C58 / AA 
ATCC 33970)] align 

Score = 40.0 bits (92), Expect = 0.14 

Identities = 66/293 (22%), Positives = 113/293 (38%), Gaps = 46/293 (15%) 

Query : 411 WLRVIDGHSNQWVQGKTAP -VEGYRKGVQLGGEVFTWQNESNQLS - IGLMGGQAEQRSTF 468 

W R ++ G +P +G G+Q G ++ + E+ GL G + 

Sbjct: 652 WARTFGQNTEMKWDGTVSPSFDGNLFGLQAGQDLLGRETEAGGFDRFGLFVGYSRMNGDI 711 
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Query : 4 6 9 HNPDT - - DNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTE -R 52 5 

H-NL GV.G AW + + GYD+4- F + G 

Sbjct: 712 KGQALGWNNLAVGEVDIGGTSFGAYWTHVGAQ--GWYLDAVLMGTWFSGDATSRAGESVN 769 

Query: 526 FTS KG ITAS I EAGYNALLAEHFTKKGNS LRVYLQPQAQLTYLGVN GKFS DS 576 

+ AS+E GY L E +T L+PQAQ+ + + + +FS DS 

Sbjct: 770 IDGSSVAASLEGGYPIALTEDWT LEPQAQIIWQKLSLDDEADRFSSVAFDS 820 

Query: 577 ENAHVNLLGSR QLQTRVG-VQAKAQFSLYKNIAIEPFAAVNALYHNKPFGVEMDGER 632 

+NA LG R QT G +Q + + + + + + + N P EG 

Sbjct: 821 DNAVTGRLGVRLQGNYQTDSGLIQPYLKANIWHGFSSDQMTR FDNDPIVTETGG- - 874 

Query: 633 RVINNKTAIESQLGVAVKIKSHLTLQATFNRQT GKHHQAKQG- -ALNLQW 680 

T++E G+ + ++ + AT + T G+ +A +G LN++W 
Sbjct: 875 TS LE I GGGLVAS LTEKVS VFATVD YTTNLGGERKRAI EGNI GLN I KW 921 



tr Q7WK90 Autotransporter [phg] [Bordetella bronchi septica (Alcaligenes 415 AA 
bronchisepticus) ] 

align 

Score = 39.7 bits (91), Expect = 0.19 

Identities = 43/213 (20%) , Positives = 85/213 (39%) , Gaps = 49/213 (23%) 

Query: 446 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 505 

W+ + + +L +G G + R D +++ G ++ G Y T+ G Y D 

Sbjct: 204 WRLDTGRLYLGAYAGVSRARM DDNDIMHGRIESRFLGTYLTYVD NGGFYVD 254 

Query: 506 SWMQYQRFRHRINTE DGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQ 55 9 

+ -f R +++ + + T S+EAGY+ L + * R +++ 

Sbjct: 255 AVSKLGRIDESVSFDLPLGLGDYDDDISHTTYTGSVEAGYHFKLPQ RWFVE 305 

Query: 560 PQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

PQAQ+ Y S ++ G R GV+A F+L + P+ + + L+ 

Sbjct: 306 PQAQVIY SRSSQTSVQG RAGVRAGRDFTLAGGATLRPYVSASYLH 350 

Query: 620 H NKPFGVEMDGERRVINNKTAIE 642 

K + E+ G R + A++ 
Sbjct: 351 EFSHDDSVDFGGKSYDAELPGSRWQLGAGAALD 383 



tr Q7W8X9 Autotransporter [phg] [Bordetella parapertussis] 415 AA 

align 

Score = 39.7 bits (91), Expect = 0.19 

Identities = 43/213 (20%) , Positives = 85/213 (39%) , Gaps = 49/213 (23%) 

Query: 44 6 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 5 05 

W+ ++ +L +G G + R D +++ G ++ G Y T+ G Y D 

Sbjct: 204 WRLDTGRLYLGAYAGVSRARM DDNDIMHGRIESRFLGTYLTYVD NGGFYVD 254 

Query: 506 SWMQYQRFRHRINTE DGTERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQ 559 

++R +++ D++ T S+EAGY+ L + R +++ 

Sbjct: 255 AVSKLGRIDESVSFDLPLELGDYDDDISHTTYTGSVEAGYHFRLPQ RWFVE 305 
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Query: 560 PQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

PQAQ-f Y S ++ ++ G R GV+A F+L + P+ + 4- L+ 

Sbjct: 306 PQAQVIY SRSSQTSVQG RAGVRAGRDFTLAGGATLRPYVSASYLH 350 

Query: 62 0 H NKPFGVEMDGERRVINNKTAIE 642 

K + E + G R + A++ 
Sbjct: 351 EFSHDDSVDFGGKSYDAELPGSRWQLGAGAALD 3 83 



tr Q84GK0 Secreted autotransporter protein EatA [eatA] [Escherichia 13 64 

coli] AA 

align 

Score = 39.7 bits (91), Expect = 0.19 

Identities = 77/350 (22%), Positives = 136/350 (38%), Gaps = 53/350 (15%) 

Query: 244 LENSLWEPRWDSNVGKLKTTNADIRFNTKSE - - SLLVKEDYAGGARFRFAYDPKEAKNTA 301 

+ N+LW +S + +LK ++ I + L VKE A + F + +A 

Sbjct : 92 8 MNNALWHSDRNSELKELKANDSQIELGVRGHFAKLRVKELIASNSVFLVHANNSQADQ-- 985 

Query: 302 LIFEKNVTGTSDIIFENPIDDLKSLDGHQIIKVNGTADKHAFRLSGKHQKGIYTLSLQQR 361 

L + G+ + + I + + + +1 +D++ F+ +G Q G ++ + R 

Sbjct : 986 LNVTDKLQGSNNTILVDFFNKAANGTNVTLITAPKGSDENTFK-AGTQQIGFSNITPEIR 1044 

Query: 362 PEGF LPKCKNAM I WRFMHNRLKPP I P YCALRLNNKNSD I FDRTLPRKGLW 411 

E +A + + + +NN N + D 
Sbjct: 1045 TENTDTATQWVLTGYQSVADARASKIATDFMDSGYKSFLTEVNNLNKRMGD 1095 

Query: 412 LRVIDGHSNQWVQ GKTAPVEGYRKG VQLGGEVFTWQNESNQLS IGLMGGQAEQR 465 

LR G + W + G + GYR VQ+G + ++E N I L G 

Sbjct: 1096 LRD S QGD AGGW AR I MNGTG S GE S G YRDN YTH VQ I GADR KHELN- -GIDLFTGALLTY 1150 

Query : 466 STFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRF- -RHRINTEDGT 523 

+ N + +G K G GVYA+ +-f GAY D +Y R+ +N 

Sbjct: 1151 TD- -NNASSQAFSGKTKSLGGGVYASGLF E S GAYFDL I GKYLHHDNR YTLNF AS LG 1204 

Query: 524 ER- FTSKGITAS IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGK 572 

ER +TS + A E GY ++E+ +++PQ +L Y V+GK 

Sbjct: 1205 ERS YTSHSLYAGAE IGYRYHMSEN TWVEPQMELVYGSVSGK 1245 



tr Q8UJX1 Autotransporter protein [bapA] [Agrobacterium tumefaciens 
(strain 

C58 / ATCC 33970)] 



Score = 39.3 bits (90), Expect = 0.25 
Identities = 49/199 (24%), Positives = 83/199 (41%), Gaps = 32/199 (16%) 

Query: 40 9 GLWLRVIDGHSNQWVQGKTAPVEGYRKGV QLGGEVFTWQNESNQL SIGLMGGQ 4 61 

G+W R+ H++ + A E Y + V Q G + + ES +L ++ G+ 

Sbjct: 759 GWGRIEGVHNHIEPRFSAAAAE-YDQNVFKLQAGIDGLLTETESGKLIGGFTVHYAHGK 817 

Query: 4 62 AEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED 521 

+ RS + + + + G+G G TW+ + G Y D Q + +N+ 

Sbjct: 818 TDTRS VWGDGE I S TDG YGLGGTLTWYG ENGFYLDGQAQMTWYTSGLNSLL 867 



1035 
AA 

align . 
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Query: 522 GTERFTSK GITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN-GKFSDS 576 

T G T S+E+G + + + + PQAQL Y V+ F+D+ 

Sbjct: 868 ARTNLTDNNDGFGYTLSLESGTRIAIDPGWS VT PQAQL VYSNVDFDAFTDT 918 

Query: 577 ENAHVNLLGSRQLQTRVGV 595 

A V+L LQ R+GV 

Sbjct: 919 FGAR VS LDRGE S LQGRLGV 937 



tr Q7D3A2 AGR pATSllp [AGRjpAT _511] [Agrobacterium tumefaciens (strain 1192 
C58 / AA 
ATCC 33970)] alig n 

Score = 39.3 bits (90), Expect = 0.25 

Identities = 49/199 (24%) , Positives = 83/199 (41%) , Gaps = 32/199 (16%) 

Query: 4 09 GLWLRVIDGHSNQWVQGKTAPVEGYRKGV QLGGEVFTWQNESNQL SIGLMGGQ 4 61 

G+W R+ H++ + A E Y + V Q G + + ES +L + + G+ 

Sbjct : 916 GVWGRIEGVHNHIEPRFSAAAAE-YDQNVFKLQAGIDGLLTETESGKLIGGFTVHYAHGK 974 

Query: 4 62 AEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED 521 

+ RS + + + + G+G G TW+ + G Y D ' Q + +N+ 

Sbjct : 975 TDTRSVWGDGEI STDGYGLGGTLTWYG ENGFYLDGQAQMTWYTSGLNSLL 1024 

Query: 522 GTERFTSK GITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN-GKFSDS 576 

T G T S+E+G + ++ + PQAQL Y V+ F+D+ 

Sbjct: 1025 ARTNLTDNNDGFGYTLSLESGTRIAIDPGWS VTPQAQLVYSNVDFDAFTDT 1075 

Query: 577 ENAHVNLLGSRQLQTRVGV 595 

A V+L LQ R+GV 

Sbjct: 1076 FGARVS LDRGE S LQGRLGV 1094 



tr Q6MTD5 Hypothetical transmembrane protein [MSC0473] [Mycoplasma 
mycoides 

(subsp. mycoides SC) ] 

Score =38.9 bits (89), Expect =0.32 
Identities = 45/181 (24%), Positives = 74/181 (40%), Gaps = 24/181 (13%) 

Query: 27 FFSILYTSPLLAVDYVYDKTKLTNDEITRLKKLRDKTSEYWKKETYLITEDNPKVPPFPA 86 

F ++ + L ++ + KTKLT ++ +LKK +DK Y K+ +IT K 
Sbjct: 182 FNDLMLS FYLSRI ENLNTKTKLTAKDLAKLKKFKDKYDF YLNKQEQ I ITNKKLK Q 236 

Query : 87 LYPRTYQFENINNSKKISFYDQEYTEGYLVGFARGLGVAKRNGDTEEQIRKYFKECFNSN 14 6 

L Y E +N K++ + +Y E YL + +EQI K 

Sbjct: 23 7 LKKDQYNLELVN - - KRLLKLENKY -EI YL NKQEQINNNKKAKLLKK 279 

Query: 147 TKIRDY-STCQAEKFGSHPLIVKSHIFSLGPKIKNSHINSEILSVGNYTEWANQVIHHIE 205 

K + + C + 1+ + F+ IKNS I EL ++QV+ +E 

Sbjct: 280 KKSEEIKNKCLLKLENKKQKIINKNSFTKS-SIKNSLIKQEFLQARINKNTSSQVLETLE 338 

Query: 206 N 206 
N 



873 
AA 

align 
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Sbjct: 339 N 339 



sp P45386 Immunoglobulin Al protease precursor (EC 3.4.21.72) (IGA1 1849 

IGA4_HAEIN protease) AA 

[iga] [Haemophilus influenzae] align 

Score = 38.5 bits (88), Expect = 0.42 

Identities = 34/142 (23%), Positives = 59/142 (40%), Gaps = 20/142 (14%) 

Query: 235 QT I SLDEFRL - ENSLWEPRWDSNVGKLKTTNAD IRFNTKSE SLLVKEDYAGGA 286 

Q+I + L ENS W +SNV +L TO I N + + + + L +G 

Sbjct: 852 QSIGTSQWLKENSHWHLTGNSMVNQLNLTNGHIHLNAQNDANKVTTYNTLTVNSLSGNG 911 

Query: 287 RFRFAYDPKEAKNTALIFEKNVTG TSD 1 1 FENP IDDLKSLDGH QIIKV 334 

F + D K+ K+ TG +D E ++L D + + 

Sbjct: 912 SFYYWVT)FTISnsrKSNKVvW^ 971 

Query: 33 5 NGTADKHAFRLSGKHQKGIYTL 356 

NG+ D+- A++ ++ G Y L 
Sbjct: 972 NGSVDRGAWKYKLRNVNGRYDL 993 



tr Q9Z5R3 Phg protein (Autotransporter) [phg] [Bordetella pertussis] 418 AA 

align 

Score = 38.1 bits (87), Expect = 0.55 

Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 49/213 (23%) 

Query: 446 WQNESNQLSIGLMGGQAEQRSTFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYAD 505 

W+ ++ +L +G G + R D +++ G ++ G Y T+ G Y D 

Sbjct: 207 WRLDTGRLYLGAYAGVSRARM DDMD I MHGR I E SRFLGT YLT YVD NGGFYVD 257 

Query: 506 SWMQYQRFRHRINTE DGTERFTS KG I TAS I E AG YNALL AEHFTKKGNS LRVYLQ 559 

+ + R + +. + D + + TS EAGY+ L 4- R + + + 

Sbjct: 258 AVSKLGRIDESVSFDLPLGLGDYDDDISHTTYTGSAEAGYHFKLPQ RWFVE 308 

Query: 560 PQAQLTYLGVNGKFSDSENAHVNLLGSRQLQTRVGVQAKAQFSLYKNIAIEPFAAVNALY 619 

PQAQ+ Y S ++ ++ G R GV+A F+L + P+ + + L+ 

Sbjct: 309 PQAQVIY SRSSQTSVQG RAGVRAGRDFTLAGGATLRPYVSASYLH 353 

Query: 62 0 H NKPFGVEMDGERRVINNKTAI E 642 

K + E+ G R + A++ 
Sbjct: 354 EFSHDDSVDFGGKSYDAELPGSRWQLGAGAALD 3 86 



tr Q6KD18 Antigen 43 precusor Sap [sap] [Escherichia coli] 1042 AA 

align 

Score = 37.7 bits (86), Expect = 0.71 

Identities = 38/156 (24%), Positives = 65/156 (41%), Gaps = 14/156 (8%) 
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Query: 417 GHSNQW- -VQGKTAPVEGYRKGVQLGGEVFTWQNESNQLSIGLMGGQAEQRSTFHNPDTD 474 

GH N V+G T G V+L G++ + L+ G+ G A S+ D D 

Sbjct: 757 GHDNNGGIWGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYG- -AAGHSSVDVKDDD 814 

Query: 475 NLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTEDGTERFTSKGITAS 534 

G V+ AG + L +G 4-AD Q R + + + + + R +G S 
Sbjct: 815 GSRAGTVRD - DAGSLGGYLNLVHTS SGLWAD I VAQGTRHSMKAS SDNNDFRARGRGWQGS 873 

Query: 53 5 IEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVN 570 

+E G + ++ + L+PQ Q T+ G+ + 

Sbjct: 874 LETGLPFSITDN LMLE PQLQ YTWQGL S 900 



tr Q8FDW4 Aecreted auto transpoter toxin [sat] [Escherichia coli 06] 1299 AA 

align 

Score = 37.0 bits (84), Expect = 1.2 

Identities = 66/290 (22%), Positives = 114/290 (38%), Gaps = 40/290 (13%) 

Query: 412 LRVIDGHSNQW VQGKTAPVEGYRKG VQLGGEVFTWQNESNQLSIGLMGGQAEQR 465 

LR I+G S W + G + G-f VQ+G + N+ + L G 

Sbjct: 1031 LRD INGESGAWAR I I SGTGS AGGGFSDNYTH VQVGAD NKHELDGLDLFTGVTMTY 1085 

Query: 466 STFHNPDTDNLTTGNVKGFGAGVYATWHQLQDKQTGAYADSWMQYQRFRHRINTED G 522 

+ H + +G K GAG+YA+ + +GAY D +Y + G 

Sbjct: 1086 TDSHAGS - -DAFSGETKSVGAGLYAS AMFESGAYIDLIGKYVHHDNEYTATFAGLG 1139 

Query: 523 TERFTSKGITASIEAGYNALLAEHFTKKGNSLRVYLQPQAQLTYLGVNGKFSDSENAHVN 582 

T + + S A E GY H T +4- + PQA+L Y V+GK ++ +N 

Sbjct: 1140 TRDYSSHSWYAGAEVGYRY HVTDSA WIEPQAELVYGAVSGKQFSWKDQGMN 1190 

Query : 583 LL GS RQLQTRVGVQAKAQFS L YKN I AI E P F AAVNAL Y HNKPFGVEMDGERRV 634 

L LRGV FSK++A4- + + GE+R-f 

Sbjct: 1191 LTMKDKDFNPLIGRTGVDVGKSFS -GKDWKVTARAGLGYQFDLFANGETVLRDASGEKRI 1249 

Query: 635 INNKTA - 1 ES QLGVAVKI KSHLTLQATFNRQT - GKHHQAKQGALNLQWTF 682 

K + +G+ +1+ +L F + GK++ N +-H+F 

Sbjct: 1250 KGEKDGRMLMNVGLNAEIRDNLRFGLEFEKSAFGKYNVDNAINANFRYSF 1299 



Database : EXPASY/UniProt 

Posted date: Sep 12, 2004 1:43 PM 
Number of letters in database: 501,960,298 
Number of sequences in database: 1,568,464 

Lambda K H 

0.317 0.135 0.401 



Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 
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length of query: 682 

length of database: 501,960,298 

effective HSP length: 131 

effective length of query: 551 

effective length of database: 296,491,514 

effective search space: 163366824214 

effective search space used: 163366824214 

T: 11 



A: 40 
XI: 16 



X2 : 
X3 : 



38 
64 



SI: 41 
S2 : 77 



( 7.3 bits) 
(14.6 bits) 
(24.7 bits) 
(21.7 bits) 
(34.3 bits) 
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NiceProt 
View of 
Swiss- 
Prot: 
P19196 



Printer-friendly view Submit update j Quick BlastP searcl 



[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] 

[ Features ] [Sequence] [ Tools ] 

Note: most headings are clickable, even if they don 't appear as links. They link to the user manual or other documents . 
Entry information 



Entry name 

Primary accession number 
Secondary accession numbers 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in 
Name and origin of the protein 
Protein name 
Synonyms 
Gene name 
From 

Taxonomy 



INVAJVEREN 
P19196 

None 

Release 16, November 1990 
Release 16, November 1990 
Release 40, October 2001 



Invasin 

None 
None 

Yersinia enterocolitica [TaxID: 630 ] 

Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales ; 
Enterobacteriaceae ; Yersinia . 

References 

[1] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=8081C / Serotype 0:8; 

MEDLINE-91041720;PubMed=2233250 [ NCBL ExPASy, EM, Israel Japan ] 
Young V.B. . Miller V.L. , Falkow S. . Schoolnik G.K. : 

"Sequence, localization and function of the invasin protein of Yersinia enterocolitica. 
Mol. Microbiol. 4:1119-1128(1990). 

Comments 

• FUNCTION: Invasin is a protein that allows enteric bacteria to penetrate cultured mammalian 
cells. The entry of invasin in the cell is mediated by binding several beta-1 chain integrins. 

• SUB CELL ULAR LOCA TION: Outer membrane. 

• SIMILARITY: Belongs to the intimin/invasin family. 
Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinformatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use by non-profit institutions as 
long as its content is in no way modified and this statement is not removed. Usage by and for commercial entities requires a 
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EMBL 

PIR 

HSSP 

InterPro 
Pfam 



license agreement (See http://www.isb-sib.ch/announce/ or send an email to license@isb-sib.ch ) 
Cross-references 

X53368; CAA37448.1; -.[ EMBL / GenBank / DDBJ ] [ CoDineSequence l 
SI 1442 ; SI 1442. 

PI 1922 ; 1CWV. [ HSSP ENTRY / PDB ] 
IPR003344 ; Bigl. 
IPR003535 ; Intimin. 
IPR008964 ; Invasin_intimin. 
Gra phical view of domain structure . 
PF02369 ; Bigl; 1. 

Pfam graphical view of domain structure . 
PR01369 ; INTIMIN. 
SM00634 ; BID 1; 1. 

[ Domain structure / List of seq. sharing at least 1 domain ] 
[ Famil y / Alignment / Tree ] 
P19196 . 
P19196 . 
P19196 . 
P19196 . 
P19196 . 
P19196 . 

P19196 ; 6133FF9FDB8D9BBA. 
Get re g ion on 2D PAGE . 

View cluster of proteins with at least 50% / 90% identity. 



PRINTS 
SMART 
ProDom 
HOBACGEN 
BLOCKS 
ProtoNet 
ProtoMap 
PRESAGE 
DIP 

ModBase 
SMR 

SWISS-2DPAGE 
UniRef 
Keywords 
Outer membrane . 

Features 
None 

Sequence information 

Length: 835 Molecular weight: 91361 CRC64: 6133FF9FDB8D9BBA [This is a checksum on the 

sequence] 
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RVALFGKDNL QSNPYAVTTG LIYTPIPFIT LGVDQRMGKS RQHEIQWNLQ MDYRLGESFR 
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NiceProt 
View of 
Swiss- 
Prot: 
P11922 



Printer-friendly view 



Submit update 



Juick BlastP search 



I 



[Entry into] [Name and [References] [Comments] [Cross-references] [Keywords] 

[ Features ] [ Sequence ] [ Tools ] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents . 
Entry information 



INVAYERPS 
P11922 

None 

Release 12, October 1989 
Release 12, October 1989 



Entry name 

Primary accession number 
Secondary accession numbers 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in Release 45, October 2004 
Name and origin of the protein 
Protein name Invasin 
Synonyms None 
Gene name None 

From Yersinia pseudotuberculosis [TaxID: 633 ] 

Taxonomy Bacteria ; Proteobacteria : Gammaproteobacteria ; Enterobacteriales ; 

Enterobacteriaceae ; Yersinia . 

References 

[1] SEQUENCE FROM NUCLEIC ACID. 

DOI- 10.1016/0092-8674(87)90335-7 ;MEDLINE=87301720:PubMed=3304658 [ NCBL ExPASy, 
EBI, Israel Japan ] 

Isberg R.R. , Voorhis D.L. . Falkow S. ; 

"Identification of invasin: a protein that allows enteric bacteria to penetrate cultured mammalian 
cells."; 

Cell 50:769-778(1987) . 
[2] INTEGRIN-BINDING DOMAIN. 

MEDLINE=90269235;PubMed=1693333 [ NCBL ExPASy , EM, Israel , Japan ] 
Leong J.M. , Fournier R.S. . Isberg R.R. : 

"Identification of the integrin binding domain of the Yersinia pseudotuberculosis invasin protein."; 
EMBO J. 9:1979-1989q990l 

[3] X-RAY CRYSTALLOGRAPHY (2.3 ANGSTROMS) OF 494-985. 

DOI= l0.ll26/science.286.5438.29l :PubMed=l0514372 [ NCBL ExPASy , EBI, Israel Japan ] 
Hamburger Z. A., Brown M.S. , Isberg R.R. , Bjorkman P.J. ; 
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"Crystal structure of invasin: a bacterial integrin-binding protein."; 
Science 286:291-295(1999) . 

Comments 

• FUNCTION: Invasin is a protein that allows enteric bacteria to penetrate cultured mammalian 
cells. The entry of invasin in the cell is mediated by binding several beta-1 chain integrins. 

. SUBCELLULAR LOCATION : Outer surface. 

• SIMILARITY: Belongs to the intimin/invasin family. 

• CA UTION: It is uncertain whether Met- 1 , Met- 1 7 or Met- 1 9 is the initiator. 
Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinformatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use by non-profit institutions as 
long as its content is in no way modified and this statement is not removed. Usage by and for commercial entities requires a 
license agreement (See http://www.isb-sib.ch/announce/ or send an email to license@i sb-sib.ch) 

Cross-references 

M17448; AAA27633.1; -. [ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 

M17448; AAA27632.1; ALTINIT. [ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
Ml 7448; AAA27634.1; ALT INIT. [ EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
M17448; AAA27635.1; ALTJNIT. [EMBL / GenBank / DDBJ ] [ CoDingSequence ] 
A29646; A29646. 
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1CWV; X-ray; A=494-985.[ ExPASy / RCSB / EBI ] 

IPR003344 :Big_l. 

IPR003535 : Intimin. 

IPR008964 : Invasin intimin. 

Graphical view of domain structure . 

PF02369 : Big_l;2. 

Pfam graphical view of domain structure . 
PR01369; INTIMIN. 
SM00634 : BID_1;2. 

[ Domain structure / List of seq. sharing at least 1 domain ] 
[ Family / Alignment / Tree ] 
PI 1922. 



PI 1922 



PI 1922 



PI 1922 



PI 1922 



PI 1922 



PI 1922 



... C875941B24BD35EE. 
SWISS-2DPAGE Get region on 2D PAGE . 

UniRef View cluster of proteins with at least 50% / 90% identity. 

Keywords 

3D-structure . 

Features 



Key 



Feature table viewer 



From To Length Description 



Feature aligner 
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QPYSVSSSAF ENLHPNNEME SSINPFSASD TERNAAIIDR ANKEQETEAV NKMISTGARL 

130 140 150 160 170 180 

I I I I I I 

AASGRASDVA HSMVGDAVNQ EIKQWLNRFG TAQVNLNFDK NFSLKESSLD WLAPWYDSAS 

190 200 210 220 230 240 

I I I I I I 

FLFFSQLGIR NKDSRNTLNL GVGIRTLENG WLYGLNTFYD NDLTGHNHRI GLGAEAWTDY 

250 260 270 280 290 300 

I I I I I I 

LQLAANGYFR LNGWHSSRDF SDYKERPATG GDLRANAYLP ALPQLGGKLM YEQYTGERVA 
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LFGKDNLQRN PYAVTAGINY TPVPLLTVGV DQRMGKSSKH ETQWNLQMNY RLGESFQSQL 
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SPSAVAGTRL LAESRYNLVD RNNNIVLEYQ KQQWKLTLS PATISGLPGQ VYQVNAQVQG 

430 440 450 460 470 480 
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ASAVREIVWS DAELIAAGGT LTPLSTTQFN LVLPPYKRTA QVSRVTDDLT ANFYSLSALA 
490 500 510 520 530 540 

I I I I I I 

VDHQGNRSNS FTLSVTVQQP QLTLTAAVIG DGAPANGKTA ITVEFTVADF EGKPLAGQEV 
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SSFTPNVSVN DQGQVTITYQ TYSEVAVTAK SKKFPSYSVS YRFYPNRWIY DGGRSLVSSL 

910 920 930 940 950 960 
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EASRQCQGSD MSAVLESSRA TNGTRAPDGT LWGEWGSLTA YSSDWQSGEY WVKKTSTDFE 
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L5: Entry 1 of 10 File: PGPB Dec 26, 2002 

PGPUB - DOCUMENT - NUMBER : 20020197276 
DOCUMENT- IDENTIFIER: US 20020197276 Al 

TITLE: Heterologous protection induced by immunization with invaplex vaccine 
PUBLICATION-DATE : December 26, 2 002 

US -CL- CURRENT: 424 / 203 . 1 
INT-CL: [07] A61 K 39/ 116 



L5: Entry 2 of 10 File: PGPB Jul 26, 2001 

PGPUB -DOCUMENT -NUMBER: 20010009957 
DOCUMENT- IDENTIFIER: US 20010009957 Al 

TITLE: Invaplex from gram negative bacteria, method of purification and methods of 
use 

PUBLICATION-DATE: July 26, 2 001 

US -CL- CURRENT: 530 / 395 ; 435 / 195 
INT-CL: [07] C12 N 9/14, C07 K 14/24 



L5: Entry 3 of 10 File: USPT Jan 20, 2 004 

US-PAT-NO: 6680374 

DOCUMENT- IDENTIFIER: US 6680374 B2 

TITLE: Invaplex from gram negative bacteria, method of purification and methods of 
use 

DATE-ISSUED: January 20, 2004 

US -CL- CURRENT: 530 / 388.1 ; 424 / 130.1 , 424 / 141.1 , 424/ 150.1 , 424 / 164.1 , 435 /329, 
435/332, 435/340, 530/ 350 , 530 / 388.2 , 530 / 388.4 



INT-CL: [07] C07 K 16/00, C12 P 21/08 



L5: Entry 4 of 10 File: USPT Aug 21, 2001 

US-PAT-NO: 6277379 

DOCUMENT- IDENTIFIER: US 6277379 Bl 

TITLE: Use of purified invaplex from gram negative bacteria as a vaccine 
DATE-ISSUED: August 21, 2001 

US -CL- CURRENT: 424 / 197.11 ; 424 / 193.1 , 424 / 203.1 , 424 / 234.1 , 424 / 241.1 , 424 / 249.1 , 
424/ 252.1 , 424 / 258.1 , 435/975, 530/350 , 536 / 123.1 
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INT-CL: [07] A61 K 39/ 385 



L5: Entry 5 of 10 



File: USPT 



Jun 12, 2001 



US-PAT-NO: 6245892 

DOCUMENT- IDENTIFIER: US 6245892 Bl 

TITLE: Invaplex from gram negative bacteria, method of purification and methods of 
use 

DATE -ISSUED: June 12 , 2 001 

US -CL- CURRENT: 530/350; 424 / 282.1 , 435/7.2, 530/416 



INT-CL: [07] C07 K 14/00, C07 K 1/00, G01 N 33/53, A61 K 45/00 



L5: Entry 6 of 10 File: USPT Jul 4, 2000 

US-PAT-NO: 6083683 

DOCUMENT- IDENTIFIER: US 6083683 A 

TITLE: Methods for detecting shigella bacteria or antibodies to shigella bacteria 
with an immunoassay 

DATE -ISSUED: July 4, 2 00 0 

US -CL- CURRENT: 435 /4; 424 / 282.1 , 424 / 93.4 , 435 / 252 .1 , 435 /29, 435 /34, 435 /822, 
435 / 975 
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L5: Entry 7 of 10 File: DWPI Dec 3, 2002 

DERWENT ~ACC -NO : 2003-129359 
ABSTRACTED -PUB -NO: WO 2 0 02 94 19 OA 
COPYRIGHT 2 004 DERWENT INFORMATION LTD 

TITLE: Inducing in a subject a protective immune response against infection with a 
first invasive gram-negative bacteria by administering a composition comprising 
Invaplex 5 0 from a second heterologous invasive gram negative bacteria 

INT-CL (IPC) : A61 K 0/00, A61 K 39/02, A61 K 39/108, A61 K 39/ 112 , A61 K 39/ 116 , 
A61 K 39/385, C07 K 1/OjO, GOl N 33/53 
Derwent-CL (DC) : B04, D16 

CPI Codes: B04-B04C1; B04-C02V; B04-F10A; B04-F10A3; B04-F10A5; B04-N03; B14-A01A; 
B14-A01A3; B14-A01A5; B14-S11B; D05-H07; 



L5: Entry 8 of 10 File: DWPI Aug 25, 2 004 

DERWENT- ACC -NO: 2000-292989 
ABSTRACTED -PUB -NO: US 62 773 79B 
COPYRIGHT 2 004 DERWENT INFORMATION LTD 

TITLE: New vaccine for protection against infection with gram-negative bacteria 
comprising Invaplex of the bacteria to elicit protective antibodies 
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INT-CL (IPC) : A61 K 0/00, Ml K 39/02, A61 K 39/385, A61 P 31/04 
Derwent-CL (DC) : B04, D16 

CPI Codes: B04-B04M; B04-F10A; B04-G07; B11-C06; B14-A01A; B14-S11B; D05-H07; DOS- 
HI 1 / 



L5; Entry 9 of 10 File: DWPI Jan 20, 2004 

DERWENT-ACC-NO: 2000-292988 
ABSTRACTED -PUB -NO: US 6245892B 
COPYRIGHT 2 004 DERWENT INFORMATION LTD 

TITLE: New composition comprising isolated Invaplex of gram-negative bacteria 
comprising at least one invasin protein associated with LPS of the gram-negative 
bacteria 

INT-CL (IPC) : A61 K 0/00, A61 K 45/00, C07 K 1/00, C07 K 14/00, C07 K 14/24, C07 K 
16/00, C12 N 9/14, C12 P 21/08, G01 N 33/53 
Derwent-CL (DC) : B04, D16 

CPI Codes: B04-B04L; B04-B04M; B04-F01; B04-F10A; B04-N03; D05-H04; D05-H07; D05- 
H09; D05-H11; 



L5 : Entry 10 of 10 File: DWPI Apr 7, 2003 

DERWENT-ACC-NO: 1996-209840 
ABSTRACTED -PUB -NO: US 5679564A 
COPYRIGHT 2004 DERWENT INFORMATION LTD 

TITLE: Prodn. of enhanced antigenic enteric bacteria for use in e.g. vaccines - 
uses bile acids, bile salts, oxygen, nitrogen, carbon-di : oxide and divalent cation 
chelant 704283 B 

INT-CL (IPC) : A01 N 63/00, A61 K 39/00, A6JL K 39/02^, A61 K 39/106, A61 K 39/108, 
A61 K 39/38, A61 K 45/00, A61 p 1/00, A61 P 31/04, C12 N 0/00, C12 N 1/00, C12 N 
1/12, C12 N 1/20, C12 N l/38, C12 Q 1/00, C12 Q 1/02, C12 Q 1/04 » G01 N 33/53, G01 
N 33/531, G01 N 33/569, C12 N 1/20, C12 R 1:19, C12 N 1/20, C12 R 1:63, C12 N 1/20, 
C12 R 1:42, C12 N 1/20, C12 R 1:01, C12 N 1/20, C12 R 1:19, C12 N 1/20, C12 R 1:63, 
C12 N 1/20 , C12 R 1:42, C12 N 1/20, C12 R 1:01 ~ 
Derwent-CL (DC) : B04, C07 , D16 , S03 

CPI Codes: B04-F10A; C04-F10A; B04-G01; C04-G01; B10-B01B; C10-B01B; B11-C07A; Cll- 
C07A; B12-K04A; C12-K04A; B14-S11B; C14-S11B; D05-H04; D05-H07; D05-H09; 
EPI Codes: S03-E14H4; 
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